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Hope from the Pope 


The Vatican has produced a timely and valuable warning on the threat of climate change that will 


reach a wide audience. 


been ambivalent. Famously, it took the Vatican 359 years to 

formally concede that Earth moves around the Sun: only in 
1992 was Galileo Galilei, who proclaimed that fact in 1633, officially 
‘rehabilitated’ by Pope John Paul II. 

It is gratifying that Catholic officials have moved on. Pope Francis’s 
much-anticipated encyclical letter on climate and the environment, 
released last week with great fanfare, is of course pastoral in tone and 
intent. And yet, the papal letter testifies to the respect and honest under- 
standing that the Argentinian Pope and his cardinals have for science. 

“Science is the best tool by which we can listen to the cry of the Earth, 
Cardinal Peter Turkson, president of the Pontifical Council for Justice 
and Peace, said on releasing the 192-page letter, drafted in consultation 
with leading climate experts. While calling on leaders to step up efforts 
to reduce global greenhouse emissions (which the encyclical acknowl- 
edges to be the main cause of warming), the Pope warns in strong lan- 
guage that unabated climate change will cause severe disruptions to 
ecosystems and human societies. His letter to the world’s more than 
one billion Catholics repeatedly stresses the moral imperative for action 
on global warming, poverty and overexploitation of natural resources. 

With international climate negotiations entering a decisive phase, the 
Vatican has chosen the right time to weigh in. Combined with a pledge 
earlier this month from the G7 leading industrialized nations to phase 
out fossil fuels, hopes are increasing that the next round of global climate 
negotiations, in Paris in December, might reach a meaningful deal. 

Scientists and political leaders in favour of climate action have 
rightly expressed gratitude and admiration for Pope Francis’s brave 
move. In doing so, President Barack Obama said that the United States 
must be a leader in efforts to cut carbon pollution and protect the 
environment. Other nations must follow suit. It will take more than 
spiritual aid to set the course for a clean-energy future, but the politi- 
cal weight of the Pope's appeal to the moral conscience and reason of 
millions of people must not be underestimated. 

The fact that the Vatican has sided with those who criticize the 
excesses of capitalism and consumerism — systems that the Pope labels 
in the encyclical as culprits in the world’s ecological crisis — could fuel 
suspicion in some quarters. But his plea for moderation and sustainabil- 
ity should not be dismissed as naive or utopian. The long-held view that 
continuous economic growth produces perpetual happiness and well- 
being is increasingly being questioned, including by forward-looking 
economists and social psychologists. And the financial crisis of recent 
years has produced ample evidence of the risks of runaway capitalism. 

Wisely, Pope Francis has framed the issue of global change along the 
lines of hope and opportunity, rather than of doom and gloom. Why, he 
asks, should humankind take the risk of dangerously altering the planet's 
climate and biosphere when there are viable alternatives? 

The papal calls to end poverty and share the world’s ecological space 
in a fair way are objectives that mirror the United Nations’ Sustainable 


Ries between the Catholic Church and science have long 


Development Goals, to be released in September. The Pope's letter 
adds an important facet to the discussion: it is not merely conceivable 
to secure a sound future for human civilization without relying on 
coal, oil and gas — it is a prerequisite. 

Given the Pope’s moral authority and sky-rocketing popularity — not 
just among Catholics — his words might travel farther than sober 
scientific reports by bodies such as the Intergovernmental Panel on 

Climate Change (IPCC). Those tomes tend 


“It will take to brim with jargon and are read by relatively 
more than few people. And unlike the IPCC, which 
spiritual aid to sometimes struggles with public outreach, 
set the course the Catholic Church has a superb means of 
for aclean- spreading the word every Sunday. A global 


energy future.” campaign that the church plans to run in com- 
ing months could step up the urgency for the 
December climate talks. To build useful momentum, climate policies 
must be top of the agenda when Pope Francis visits the White House 
and addresses the UN General Assembly in September. 

Meanwhile, within the encyclical are other issues that are impor- 
tant for science. Remarkably, the Pope concedes that intervening with 
plants and animals is “permissible when it pertains to the necessities 
of human life”. The clause signals a rethink of the Catholic Church's 
attitude to genetic engineering and animal research. Alas, he remained 
silent on issues of contraception. With a world population heading 
towards a possible 10 billion, the importance of family planning is 
clear. The Vatican has been brave on climate change. If it is serious 
about the fate of the planet and the welfare of its inhabitants, then it 
must be braver still on the issue of contraception. m 


Data overprotection 


Draft Europeanrules governing privacy 
threaten to hamper medical research. 


hen officials at the European Commission proposed new 

W iitertetion rules in 2012, the prospects for science 

looked good. Three years on, that optimism has been 

replaced by concern. The best that researchers can hope for from the 

rules now, it seems, is that they do not make things worse. For, as poli- 

ticians continue to try to protect the individual in a digital world, they 

risk inflicting major long-term damage on the research environment. 

The rules aim to update the 1995 data-protection regulations 

to reflect the reality of the digital age, in which information about 
individuals is increasingly being used as a commodity. 
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A pan-European law on how data could be sourced, stored 
and used would, in theory, be good for research. Greater harmoniza- 
tion could smooth the difficulties that scientists face when they try 
to pool analysis of genomic data and tissue samples across national 
borders. Such sharing could help to organize powerful trials with 
large numbers of participants. But it is held back at present because 
different European countries have their own rules on issues such as 
informed consent, or on how to anonymize or pseudonymize data. 

Unfortunately, as a result of their reworking, the draft rules threaten 
to place further restrictions on such studies, by tightening the way 
that samples and data can be used. The situation is so perilous that 
researchers met in Brussels last week for a “Day of Action on Data 
for Health and Science; to explain to policy-makers “the reasons 
why personal data is necessary to scientific research, including 
medical research”. 

The reasons might sound obvious to scientists, and the need to 
explain them to politicians may seem distressing. But this is no anti- 
science agenda at play here. No one had intended to make medical 
research more difficult. The trouble is that such research is only a 
small part of the legislative issue. And discussions all take place in 
ministries of justice and other places where grasp of the mysterious 
world of science is relatively weak. 

There is hope, in other words, that the feared damage can be avoided 
when the draft rules are finalized, possibly by the end of this year, as 
long as enough people make enough fuss to attract enough attention. 
To their credit, science funders and lobby groups across Europe have 
been doing just that. 

The regulations proposed in 2012 by the commission allow the use 
of personal data only after an individual has given “specific, informed 
and explicit” consent. It is, after all, directed primarily at potential 
misuse of data gathered at websites or apps such as Facebook. But it 
made exceptions for health research because it recognized that such 


work depends on broad models of consent in which participants 
can agree for their data to be used in a variety of research studies, 
and that — subject to suitable safeguards — the data can be stored 
for future use. 

Parliamentary amendments agreed last year changed the empha- 
sis, and in doing so removed the exemption for research. If accepted, 
the amendments would require consent from participants for each 

new study. That would impede research. It 


“The ‘big data’ is something that patient groups say they 
era poses new do not want if they donate data or tissue to a 
challenges, trusted party, such as established biobanks. 
to privacy for And the evidence suggests that participants 
example. “i are less likely to give re-consent the more 


frequently they are asked for it. 

Talks on the draft rules were due to restart this week, with 
trialogue negotiations scheduled between the European Commission, 
Parliament and Council. The council, at least, seems to be aware of the 
unwitting threat to research and wants to do something about it. Its 
statement on the issue, published last week, indicates that it will argue 
for the research exemption to be maintained. 

Why should science be seen as a special case? A rigorous govern- 
ance and regulatory framework for biomedical research already exists 
and operates under national and international laws. As this journal 
has chronicled, the ‘big data’ era poses new challenges, to privacy for 
example. But from a scientific standpoint, it makes more sense to 
update these existing rules as required than for valuable research to 
be threatened by overarching laws aimed at a different problem. 

The European data-protection drive is built on the principle 
of “one continent, one law, one regulation”. But even the most 
committed believers may have to accept that biomedical research must 
remain an exception — and be granted an exemption — if it is to 
continue to thrive. m 


Life under the ice 


Antarctica’s apparent barrenness hides an 
abundance of living organisms. 


any of our views of the continent of Antarctica are influ- 
Mie by the words of the great polar explorers. Robert 

Falcon Scott, a man ultimately drawn to his death there, 
famously wrote of the Antarctic, “Great God, this is an awful place”. 
Present-day adventurers, from those who follow in the footsteps of 
Scott to the interior to tourists skirting around the edge, also trade on 
the sheer other-worldliness of Earth’s southern cap. The more remote 
Antarctica is, the more their sense of achievement. Head there by ship 
and — if you can stand the swell — you can peer over the side to watch 
the seawater change colour at the convergence point where cold cur- 
rents circling the continent push up against the warmer waters of the 
southern Pacific Ocean. 

To stand on Antarctica is truly like nothing else on Earth. Ona still 
day, it is as if time itself has frozen. Senses, the essential guides to life 
on our planet, are almost redundant. There is no movement, no sound 
and no smell. Scientists who spend time there have been known to 
take curry powder on their field expeditions, to prepare spicy food 
to compensate for the lack of other stimulation. With centuries-old 
snow beneath your feet and only the occasional rumble of distant ice 
cliffs collapsing into the ocean, it is easy to imagine Antarctica — and 
those who spend time there — as isolated from the rest of the world 
and its life. 

Lookin a different direction, however, and the illusion fades. Scott 
knew this. “As one looks across the barren stretches of the pack, it is 
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sometimes difficult to realise what teeming life exists immediately 
beneath its surface,’ he wrote. “Beneath the placid ice floes and under 
the calm water pools the old universal warfare is raging incessantly in 
the struggle for existence.’ There is life in Antarctica; you just need to 
know where to look. 

For a sense of the teeming life that exists in the white continent, 
take a look at the Review article that we publish this week on page 431. 
Yes, there are only two species of flowering plants in Antarctica, but 
nobody said that the place had to look pretty. Think small. More than 
200 species of lichen and 100 species of moss cling to the coastline and 
to patches of exposed rock. The packed snow that rests on the Antarc- 
tic bedrock is riddled with water-filled holes, and these subglacial lakes 
support the highest diversity of viruses of any known aquatic system. 

Still not impressed? Head offshore — not too far — and the Antarc- 
tic waters thrive with so much life that they are considered to harbour 
more biodiversity than the celebrated Galapagos Islands. Through- 
out the Southern Ocean, researchers have discovered more than 
8,000 marine species, most of which scrape a living from the seabed. 
From sucha rich base, a network of life spreads upwards and outwards 
until it culminates in the region's iconic seals, penguins, whales and 
albatrosses. 

The point of this continental audit is threefold. Antarctica is alive 
— and therefore dynamic. Some change is bad and some good, but 
most points in the same direction: that Antarctica’s isolation from 
the rest of the world — biological, physical and cultural — is weak- 
ening. And in a world that increasingly looks to value from the 
natural environment, the Review article sums up 
the place nicely. “Antarctic ecosystems provide 
several services essential to planetary stability.” 
Awful or not, Antarctica is closer to home than 
many realize. m 
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WORLD VIEW  jennisicos sen 


London laboratory of Tim Hunt. Given the media frenzy over 

his comments this month about women scientists, and the way 
he has been portrayed as sexist, you might think that I had a miserable 
time. Not at all; it was a great experience. He was an inspiring and 
supportive mentor. Not once did I feel that he treated me differently 
because I was a woman. 

The anger that many people in research feel at Tim’s inappropriate 
and indefensible comments — that women and men in the lab fall in 
love and that women cry when criticized — is understandable. But 
the reaction to them, and their impact on his career, has been dispro- 
portionate. Tim has apologized for his remarks, and I, for one, accept 
that apology. Others should do the same, and he should be allowed to 
continue his work. 

Tim says that his comments were intended 
as a joke. I believe him, because I was once on 
the receiving end of one of his public jokes 
— and his attempt at humour then was just 
as bad. 

My early research was funded by a Euro- 
pean consortium. Each year, Tim and I had to 
attend the consortium conference, where the 
group leaders would discuss the work. As we 
travelled to the first of these conferences in 
April 2005, Tim realized that he did not have 
a photo of me to include in his talk, which was 
that afternoon. He always travelled with a 
camera in his shirt pocket, so he took a photo 
then. During the talk, he introduced the 
picture to the audience with the line: “There is 
Alessia, wearing exactly the same jumper.” Of 
course, everyone in the audience searched for me among themselves 
and, having verified that I was indeed wearing that jumper, hada 
good laugh. 

I was so embarrassed. There I was, at the first meeting to discuss 
my research, and I had to start by explaining to new colleagues that, 
yes, I did own more than one set of clothes. Still, I was not offended. 
I knew he was joking, and it did provide a way of breaking the ice 
when I met people later. 

The fear of many scientists, and of those organizations that have 
severed their ties with Tim, is that his judgement might be biased. 
I know him well and can say that he is not sexist. I once attended a 
‘Women in Science’ event where his wife Mary was one of the speakers 
and where many of the issues that we have read about in the media 
were discussed. Tim was also in the audience, 


Fe 2004 until December 2010, I worked as a scientist in the 


and I know that he engaged with the problem ONATURE.COM 
of discrimination against women scientists and _ Discuss this article 
supports efforts to tackle it. online at: 


I have seen discrimination and sexism in go.niature.com/zsplfr 
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Judge by actions, 
not words 


Sexist comments made by my former boss Tim Hunt are not an indication that 
he is biased against women, argues Alessia Errico. 


science and in wider society. I have seen female colleagues talked 
about in negative ways when they left the lab to have children. The 
issue is a genuine one that demands urgent attention. But it is grossly 
unfair that Tim should be considered, and treated, as an emblem of 
this sexism or gender discrimination. 

I treasure all scientific discussions with Tim; he always had an 
interesting point of view and would constructively challenge us all. 
When I delivered my first seminar at our institute, Tim discussed 
the slides with me several times to ensure that the logic behind the 
project and each experiment was clear. I gave several talks while I 
was in the lab and his contribution to my preparation was always 
useful and positive. Although very busy, whenever he could he 
visited the lab, and would pop by to say hi and ask how things 
were going. 

He always supported my work, even when 
my research project took a different direction 
from the lab’s main expertise. For example, he 
arranged for me to visit a lab in Japan to learn 
a specific protocol that proved fundamental to 
my research. His passion for discoveries had 
no boundaries. It is important to remember 
that although we need role models and cham- 
pions for women in science, we also need truly 
inspirational figures for science. 

When my European grant ended, it was 
exclusively thanks to Tim that I got the extra 
funding guaranteeing me a good amount of 
time to continue my project after my mater- 
nity leave. At the time, we had several chats 
about my future. I had doubts about a career 
as a group leader, but Tim said I was good at 
science and technically very skilful, giving me the encouragement 
that I needed. 

After my career in research ended, I became an editor. I know the 
weight of words; however, facts and actions are much more impor- 
tant. A year after that first consortium conference, and wearing a 
different jumper, Tim and | again headed for the annual meeting. This 
time, I had a different kind of shock. Tim suggested that I should be 
the speaker, not him, because it was my research. Well, it was quite 
stressful. The members of that consortium included some of the most 
important cell-cycle scientists in Europe, but it was a great oppor- 
tunity, and not only for me. Tim’s action led to other group leaders 
leaving the stage to their postdoctoral fellows. Some were men 
and some were women. I doubt Tim cared which; all he saw were 
promising young scientists. = 


Alessia Errico is a former editor for Nature Reviews Clinical 
Oncology. 
e-mail: alessia.errico@googlemail.com 


25 JUNE 2015 | VOL 522 | NATURE | 393 


© 2015 Macmillan Publishers Limited. All rights reserved 


Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Rising cost of 
climate inaction 


Increasing temperatures could 
boost electricity demand and 
costs over the next several 
decades, if warming continues 
unabated. 

James McFarland at the US 
Environmental Protection 
Agency in Washington DC 
and his team analysed rising 
temperature trends and climate 
policies from 2015 to 2050 
using three electricity-sector 
models for the United States. 
They found that electricity 
demand in 2050 would rise by 
1.6-6.5% more in scenarios 
with increasing temperatures 
than in the case of constant 
temperatures, owing to factors 
such as greater use of air 
conditioning. 

Without global action, the 
cost of electricity generation 
would grow by 1.7-8.3%, 
which is comparable to the 
projected costs of cutting 
carbon emissions from the 
power sector by 50% by 2050. 
Climatic Change http://doi. 
org/5gh (2015) 


Colon-cancer 
cells made normal 


By turning ona particular 
gene, researchers have made 
colon-cancer cells in mice 
revert back to normal ones. 
Scott Lowe at the Memorial 
Sloan Kettering Cancer Center 
in New York and his colleagues 
engineered mice so that 
they could use a small RNA 


Elephant-poaching hotspots pinpointed 


DNA analysis of seized ivory suggests that 
elephants have been poached at high rates in just 


two regions in Africa. 


Samuel Wasser at the University of 
Washington in Seattle and his team studied 
genetic material from 28 large ivory seizures 
between 1996 and 2014 to identify the origins 
of poached tusks. They found that 86-93% of 
savannah-elephant tusks seized by authorities 


molecule to switch on the Apc 
gene, a tumour suppressor that 
is mutated in most colorectal 
cancers. Turning on this gene 
shrank precancerous polyps 

in the colon, and cells in the 
polyps developed and behaved 
normally. Even reactivating 
Apc in full-blown tumours 
bearing other cancer-driving 
mutations made the cells non- 
cancerous. 

Silencing Apc in 3D cultures 
of mouse colon cells led to 
cancer-like growth (pictured, 
left; scale bars are 50 um), 
whereas switching it back on 
restored normal cell division 
(pictured, right). Targeting 
the cell-signalling pathway 
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since 2006 came from southeastern Tanzania and 
neighbouring northern Mozambique. A similar 


proportion of forest-elephant ivory originated 


to which Apc belongs (called 
Wnt) could be a therapeutic 
strategy, the authors say. 

Cell 161, 1539-1552 (2015) 


Graphene protects 
cells for imaging 


A single-atom-thick mesh 

of carbon can protect living 
animal cells from being 
damaged under an electron 
microscope, and could lead to 
better cell images. 

Tissue samples are typically 
dried and chemically treated to 
protect them from the vacuum 
of electron microscopes, 
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from a region in central Africa: the nexus of 
Gabon, the Republic of Congo and the Central 
African Republic. The data could help authorities 
to focus regulatory and law-enforcement efforts 
on these regions, the authors say. 

Science http://doi.org/5h6 (2015) 


but this kills cells and can 
introduce structural artefacts. 
As an alternative, Ke Xu of 
the University of California, 
Berkeley, and his colleagues 
used graphene — which is 
conductive but impermeable 
to gas and liquid — to insulate 
live, untreated mammalian 
cells against the microscope’s 
harsh environment. The 

team obtained images that 
correlated well with other live- 
cell imaging methods. 

In another study, David 
Weitz of Harvard University 
in Cambridge, Massachusetts, 
and his team used 3-10 layers 
of graphene sheets to 
encapsulate liquid samples 
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for transmission electron 
microscopy. They imaged 
influenza viruses and cells in 
their native state. 

Nature Commun. 6, 7384 (2015); 
Nano Lett. http://doi.org/5gp 
(2015) 


CHEMISTRY 


X-rays make 
molecular movie 


Researchers have made a film of 
a molecule’s structural changes 
during a chemical reaction. 

A team led by Michael 
Minitti at the SLAC National 
Accelerator Laboratory in 
Menlo Park, California, used 
a powerful free-electron 
laser to fire ultrafast X-ray 
pulses at the ring structure 
of 1,3-cyclohexadiene as it 
opened up to form the linear 
molecule 1,3,5-hexatriene. 
The scattered X-rays provided 
structural snapshots roughly 
every 80 femtoseconds 
during the 200-femtosecond 
reaction (1 femtosecond 
is 10° seconds). Using 
these results and theoretical 
calculations, the team worked 
out the most likely path for the 
reaction. 

The technique could reveal 
new reaction mechanisms, the 
authors say. 

Phys. Rev. Lett. http://dx.doi.org/ 
10.1103/physreviett.114.255501 
(2015) 


ANIMAL BEHAVIOUR 


Kangaroos are 
lefties 


Kangaroos that use two legs to 
jump have a strong preference 
for which hand they use to 
scratch themselves, suggesting 
that pronounced handedness 
is not uniquely human. 

Yegor Malashichev of Saint 
Petersburg State University, 
Russia, and his team 
observed seven 
species of marsupial 
(including 
kangaroos) in 
Australia — three 
that walk on four legs 
and four species that 
are bipedal. They found 
that two kangaroo species 


mostly used their left forelimbs 
for actions such as grooming 
(pictured is an Eastern 
grey kangaroo, Macropus 
(Macropus) giganteus). Red- 
necked wallabies (Macropus 
(Notamacropus) rufogriseus), 
which move on two legs, 
generally used their left 
forelimb for fine manipulation 
tasks, such as feeding, and 
the right forelimb for actions 
requiring greater physical 
strength, such as holding up 
branches when feeding. 
Four-legged animals did 
not exhibit handedness, 
suggesting that two-legged 
locomotion is a prerequisite 
for this characteristic. 
Curr. Biol. http://doi.org/5jh (2015) 


Polar winds blow 
on Titan 


Electrically charged particles 
stream away from Saturn's 
moon Titan, escaping into 
space in a similar way to Earth’s 
polar atmosphere. 

Titan's thick hydrocarbon 
haze is unique in the Solar 
System. Andrew Coates of 
University College London 
and his colleagues used the 
Cassini spacecraft to detect 
charged particles in Titan's 
uppermost atmosphere 
and deep space. Particles 
flowed away from the moon, 
along magnetic field lines 
that extended more than 
17,000 kilometres from Titan. 

The work explains how 
7 tonnes of atmospheric 
molecules manage to escape 
from Titan every day. 
Geophys. Res. Lett. http://doi. 
org/5jg (2015) 


GEOPHYSICS 


An island’s ups 
and downs 


Anisland off the coast of 
Chile lurched upward during 
earthquakes in 1835 and 2010, 
but subsided in between. 
Such events provide a 
rare lookat how Earth's 
crustal plates respond 
throughout an entire 
earthquake cycle. 


RESEARCH HIGHLIGHTS Mii Saiaa¢ 


SOCIAL SELECTION 


Popular topics 
on social media 


Curiosity over computer fact-checker 


A computational fact-checker that can sort truth from fiction 
caught the attention of the online science community this 
week. Researchers at Indiana University in Bloomington 
mined Wikipedia's information boxes, which summarize key 
facts, to create a ‘knowledge graplv of 3 million people, places 
and things. The resulting algorithm then used that knowledge 
to gauge the truth of statements that were presented to it, 
such as “Rome is the capital of Italy”, with nearly the same 
accuracy as humans. The researchers acknowledge that the 
source material is not 100% reliable, as online commenters 
also noted. William Gunn, head of academic outreach for the 
research-management tool Mendeley in California, tweeted 
that the results might be different if the algorithm had been 
built using a different body of knowledge: “Wonder what 
this would look like using sci pubs as a knowledge graph?” 
Informatics researcher and lead author 
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Just after the 1835 quake, 
Robert FitzRoy of the HMS 
Beagle noted that seaweed and 
mussels had lifted as much as 
3 metres above the shoreline 
of Santa Maria island. Now, a 
team led by Robert Wesson of 
the US Geological Survey in 
Denver, Colorado, has collected 
data from other old surveys and 
modern satellite measurements 
of the same island. 

The scientists found that the 
island’s height above sea level 
dropped by about 1.4 metres 
between 1835 and 2010, before 
rising 1.8 metres during the 
later quake. This suggests that 
strain builds up erratically 
where the Nazca and South 
American tectonic plates 
collide, indicating that the 
seismic activity in that region 
is more complicated than 
scientists had thought. 

Nature Geosci. http://dx.doi. 
org/10.1038/nge02468 (2015) 


CLIMATE-CHANGE ECOLOGY 


Plant diversity 
declines in the dry 


The number of plant species in 
a California grassland area has 
dropped since 2000 as the area 
has become more arid — an 


Giovanni Ciampaglia says that the team 
plans to use other sources of information 
to improve the fact-checker. 

PLoS ONE 10, e0128193 (2015) 


indication of how such 
ecosystems might respond to 
climate change. 

Susan Harrison and her 
colleagues at the University of 
California, Davis, monitored 
plant diversity at 80 sites 
across a 27-square-kilometre 
area from 2000 to 2014. They 
found that the number of 
species at the sites and across 
the study area decreased 
over time, particularly in 
the number of native annual 
flowering plants. The decline 
in diversity correlated with 
diminishing precipitation 
during winter, when the 
annual plants are seedlings. 
The team did not find links 
between the fall in diversity 
and other possible causes 
such as fire, invasive species or 
nitrogen levels. 

A reduction in precipitation 
is predicted for this area as 
the climate warms, so such 
biodiversity losses could 
signify future extinctions. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1502074112 (2015) 
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SEVEN DAYS nescnss 


Japan whaling plan 


The scientific committee of 
the International Whaling 
Commission (IWC) has 
been unable to reacha 
consensus on the merits of 

a proposed plan by Japan 

to resume hunting whales 
around Antarctica. Japan is 
seeking IWC permission for 
a new programme of whaling 
that it claims is for scientific 
purposes, but which critics 
claim is a commercial hunt in 
disguise. In a report published 
on 19 June, the committee 
said that it had not been 
given enough information 

to say whether killing whales 
was necessary to achieve 
Japan’s stated scientific 
objectives. A previous hunt 
was deemed unscientific by an 
international court last year. 


| RESEARCH 
Ebola trial ends 


A clinical trial of an 
experimental Ebola drug 
being tested in people in 
Sierra Leone has stopped 
enrolment because the drug 
does not seem to be effective, 
the company leading the trial 
said on 19 June. The drug, 
TKM-Ebola-Guinea, consists 
of RNA molecules designed to 
stop the virus from replicating. 
Despite encouraging results 
in monkeys, the drug was 
deemed unlikely to show 


NUMBER CRUNCH 


$95m 


The amount that drug 
company GlaxoSmithKline 
will give over five years to set 
up a non-profit institute in 
Seattle, Washington, to study 
cell ‘operating systems’ and 
their role in health. 


High seas to get legal protection 


Biodiversity in ocean areas outside national 
jurisdictions — the ‘high seas’ — will receive 
protection under new laws, the United Nations 
said on 19 June. A resolution adopted by the 
UN General Assembly states that rules will 

be designed to ensure the “conservation and 


benefits in humans, said 
Tekmira, its manufacturer, 
based in Vancouver, Canada. 


Sandia head 


Materials scientist and 
engineer Jill Hruby will 
become the first woman to 
head any of the three US 
national-security laboratories 
when she takes over the 
running of Sandia National 
Laboratories on 17 July. Her 
appointment was announced 
on 22 June. Hruby has been 
at Sandia, the main offices of 
which are in Albuquerque, 
New Mexico, and Livermore, 
California, for more than 
three decades, and has had 
roles overseeing nuclear, 
biological and chemical 
security. Like the Los Alamos 
and Lawrence Livermore 
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national laboratories, Sandia 
is trying to reinvent itself 

by focusing on scientific 
research for today’s national 
security needs. 


French minister 
Thierry Mandon was named 
French minister of state for 
higher education and research 
on 17 June, after the post had 
lain vacant for three months. 
He replaces Genevieve 
Fioraso, who stepped down 
in March for health reasons. 
Mandon is a seasoned 
politician, but from 1998 to 
2014 he served as president 
of Genopole, a cluster of 
biotechnology and genetics 
companies and academic 
research labs in Evry, south 
of Paris. French researchers 
have welcomed Mandon's 
appointment. See go.nature. 
com/yizy3d for more. 
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sustainable use” of marine life in such regions, 
which are currently often unmanaged. Amid 
increasing scrutiny of fishing and other activities, 
a treaty that governs responsible use of the 

high seas has long been a goal of conservation 
organizations and some governments. 


FACILITIES 


Telescope restarts 
Construction of the Thirty 
Meter Telescope (TMT) on 
the summit of Mauna Kea in 
Hawaii is to resume on 24 June. 
Work has been halted since 
early April because of protests 
by Native Hawaiian groups, 
who see it as a violation ofa 
sacred mountain. “We are 
now comfortable that we 

can be better stewards and 
better neighbors during our 
temporary and limited use 

of this precious land,’ Henry 
Yang, chair of the TMT 
International Observatory 
Board, said in a 20 June 
statement. The controversy 
has caused Hawaii's governor 
to limit future astronomical 
development on Mauna 

Kea, which currently hosts 

13 telescopes. 
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EVENTS 


Imaging satellite 
Sentinel-2A, an Earth- 
observation satellite 

that scientists say could 
revolutionize monitoring of 
land-use changes, launched 
on 22 June from Europe's 
Spaceport in Kourou, French 
Guiana. The satellite has 
superior specifications to 
Landsat-8, the most advanced 
US Earth-observation 
satellite so far, and will be 
joined in orbit by its twin 
next year. Together they 

will image the entire planet 
every two to five days. The 
duo is part of a planned fleet 
of six families of Sentinel 
satellites that will form the 
core of the European Union’s 
Copernicus environmental 
monitoring system (see 
go.nature.com/evfinv), 
which is designed to provide 
near-real-time data on many 
environmental variables. 


Papal climate call 
Ina long-anticipated encyclical 
letter (pictured), Pope Francis 
warns that climate change 
caused mainly by human 
activity threatens to devastate 
ecosystems and human 
civilization. Released on 

18 June, the nearly 200-page 
letter contains a sweeping 
account of scientific findings 
and urges world leaders to 
transition the global energy 
system away from fossil fuels. 


TREND WATCH 


The global market for 
photovoltaic solar energy will 


rise rapidly, says the consultancy 


GTM Research in Boston, 
Massachusetts. It predicts that 
by the end of 2015, the market 
will have expanded by 36% 


with 55 gigawatts (GW) of new 
installed solar capacity; southeast 
Asia will account for 30 GW of 


this. Total solar photovoltaic 
capacity will triple by 2020 to 


nearly 700 GW. By then, demand 


for solar panels will be largely 
independent of government 
incentives, says the analysis. 


Scientists and policy-makers 
hope that the encyclical will 
help to strengthen public 

trust in climate-change 

science among the more than 

1 billion Catholics worldwide, 
and facilitate international 
efforts to cut global emissions 
of greenhouse gases. See 
go.nature.com/r1i3q4 for more. 


Thai MERS case 


Thailand announced its first 
case of Middle East respiratory 
syndrome (MERS) on 18 June. 
An elderly man, who travelled 
from Oman to Thailand and 
sought medical care, was 
diagnosed with MERS at a 
Thai hospital. The man and his 
family were put into isolation, 
and infection-control measures 
implemented. Meanwhile, 

the MERS outbreak in South 
Korea seems to be slowing 
down. As of 23 June, there 
have been 175 confirmed 

cases in South Korea and 1 in 
China. The South Korean 
Ministry of Health and Welfare 
announced on 21 June that 


SOLAR CAPACITY SOARS 


seven experts from the US 
Centers for Disease Control 
and Prevention would be 
visiting the country to provide 
technical assistance. 


End of Aquarius 

A US-Argentinian research 
satellite that carried 
Aquarius, the first instrument 
specialized to measure 

ocean salinity from space, 

has stopped working, NASA 
announced on 17 June. A 
piece of hardware aboard the 
SAC-D observatory failed and 
sent the satellite spinning out 
of control earlier this month. 
Although Aquarius had 
finished its main three-year 
mission, it had been expected 
to last for at least another year. 
Among other achievements, 
Aquarius’s maps show how 
fresh and salt water interact as 
rivers pour into the ocean. 


| BUSINESS 
Hepatitis C drug 


China’s patent office has rejected 
attempts to patent a blockbuster 
hepatitis C drug, said charity 
Médicins Sans Frontieres 

of Geneva, Switzerland, on 

19 June. The antiviral drug 
sofosbuvir is made by Gilead 
Sciences of Foster City, 
California, but health advocates 
have argued that it is too 
expensive for people in middle- 
and low-income countries. It 
costs US$84,000 per course, 
although Gilead has negotiated 


The world’s solar photovoltaic capacity is 


predicted to triple by 2020. 
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SEVEN DAYS | THIS WEEK | 


28 JUNE - 3 JULY 
More than 650 young 
researchers meet 

65 Nobel laureates 

for this year’s Lindau 
Nobel Laureate 
Meeting in Germany. 
The scientists come 
from all three natural- 
science Nobel Prize 
disciplines: medicine 
and physiology, physics 
and chemistry. 
go.nature.com/tpithk 


30 JUNE 

The world’s first 
Asteroid Day is held on 
the anniversary of the 
1908 Tunguska event 

in Siberia, the largest 
impact on Earth in recent 
history. The event is 
intended to raise global 
awareness about the risk 
from asteroids and how 
to protect Earth. 
go.nature.com/uvhz7w 


a discounted price of $900 with 
countries including Egypt. 
China’s decision — which has 
not been announced by the 
government — could spur 
other countries to challenge 

the patent, leading to the 
manufacture of cheaper generic 
versions. 


Vaccine purchase 
Pharmaceutical giant 
GlaxoSmithKline (GSK) 

of Brentford, UK, has sold 
two meningitis vaccines to 
Pfizer in New York City for 
£82 million (US$130 million). 
Announced on 22 June, the 
deal will give Pfizer Nimenrix 
and Mencevax, which protect 
against 4 strains of meningitis. 
The sale was deemed necessary 
by regulators after GSK’s 
purchase earlier this year ofa 
vaccines business owned by 
Novartis of Basel, Switzerland; 
that transaction included two 
other meningitis vaccines. 


> NATURE.COM 
For daily news updates see: 
WWww.nature.com/news 
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Urban microbes unveiled 


Genomics tracks the microscopic inhabitants of homes, transport systems and sewers. 


BY RACHEL EHRENBERG 


world’s great metropolises, amid the 
people, pigeons, cockroaches and rats, 
there is a teeming world of bacteria, viruses, 
fungi and protists that scientists are only now 
surveying. Microbes are everywhere: on trains, 
pavements and lifts; in parks, libraries, hospi- 
tals and schools. Most are innocuous, some 
are friendly, and a handful cause death and 
disease. But the vast majority are unknown. 
Researchers described results of early forays 
into this terra incognita at the Microbes in the 
City conference on 19 June, hosted by the 
New York Academy of Sciences and New York 
University (NYU) on the 40th floor ofan anti- 
septic-looking glass office tower in Manhattan. 
“We're really at the infancy of a very interesting 


Beers in the filth and chaos of the 


scientific endeavour,’ said Joel Ackelsberg, a 
medical epidemiologist for the New York City 
Department of Health and Mental Hygiene. 
“Right now, we know very little” 

Researchers are not even sure how to survey 
this strange landscape. There are competing 
techniques for detecting, quantifying and 
keeping track of which microbes are doing 
what in the built environment, and where. But 
researchers believe that efforts could lead to 
new approaches for monitoring bioterrorism, 
tracking disease outbreaks or assessing the 
impact of storms and pollution. 

Each month, high-throughput techniques 
allow scientists to sequence roughly 1,000 
microbial genomes from samples collected 
in various environments, said computational 
biologist Curtis Huttenhower of the Harvard 
T. H. Chan School of Public Health in Boston, 
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Massachusetts. That is an impressive amount 
of data, but it is dwarfed by the unfamiliar. 
Christopher Mason, a computational geneti- 
cist at Weill Cornell Medical College in New 
York City, told the conference how a baseline 
survey of genetic material from surfaces in the 
city’s subway system had uncovered DNA from 
almost 1,700 known taxa, mostly harmless bac- 
teria. But 48% of the genetic material did not 
match anything yet identified. “Half the world 
under our fingertips is unknown,’ said Mason. 
Still, trends are emerging from the global 
Metagenomics and Metadesign of Subways 
and Urban Biomes initiative (MetaSUB), which 
aims to characterize the genetic material found 
on public-transport systems in 16 world cities to 
elucidate the microscopic riders that share the 
commute. Storms leave a mark: months after 
New York City’s South Ferry Station was 
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flooded in 2012’s Hurricane Sandy, it still 
harboured DNA from bacteria associated 
with cold marine environments and fish, 
Mason said. However, most of the bacteriain 
the subway were harmless Acinetobacter spe- 
cies and others associated with human skin. 

In his talk, Huttenhower described a 
survey of Boston's transit system that yielded 
similar flora. “Everything is covered in skin,” 
he said. He noted that metal poles on the 
trains, which riders commonly consider 
hygienically suspect, actually retain much 
less bacterial biomass than the system’s 
upholstered seats or plastic hand grips. 

Microbiomes in houses tend to match 
those of the homes’ human inhabitants — 
and quickly morph after a change in occu- 
pancy, said environmental microbiologist 
Jack Gilbert of Argonne National Laboratory 
in Illinois. He and his colleagues described 
results from a survey of ten homes, which 
found that they become populated with new 
residents’ microbes within 24 hours. 

Rodents are under study, too. White- 
footed mice (Peromyscus leucopus) in New 
York City carry more Helicobacter and 
Atopobium bacteria — associated with 
stomach ulcers and bacterial vaginosis 
in humans — than their suburban coun- 
terparts, but are totally free of tick-borne 
pathogens, reported biologist Alyssa 
Ammazzalorso of the Albert Einstein Col- 
lege of Medicine in New York City. The 
city’s rats carry a number of bacteria known 
to cause problems in people, said epidemi- 
ologist Ian Lipkin, director of the Center 
for Infection and Immunity at Columbia 
University in New York. He and others have 
found pathogenic Escherichia coli, Clostrid- 
ium difficile, Salmonella enterica and the 
Seoul strain of hantavirus, which can be fatal 
when transmitted to humans (C. Firth et al. 
mBio 5, e01933-14; 2014). 

Sewage samples from New York City’s 
14 wastewater-treatment plants turned up 
a disturbing number of genes for resistance 
to antibiotics, reported genomicists Susan 
Joseph and Jane Carlton of NYU’s Center 
for Genomics and Systems Biology. As a rich 
human-derived soup spiked with antibiot- 
ics, sewage provides an ideal niche for the 
growth and spread of resistance, Joseph said. 
Martin Blaser, director of the Human Micro- 
biome Program at the NYU School of Med- 
icine, said that as populations of resistant 
microbes and their defensive tools become 
more diverse, the diversity of human-associ- 
ated microbes in general is declining. He told 
how he and his colleagues have found that 
people in the West carry fewer protective 
bacteria than isolated human groups such 
as the Yanomami of the Amazon rainforest. 

“We may have lost as much as half of our 
diversity already,” said Blaser, “just as we 
are beginning to realize how important it 
might be.’ m 
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The fungus that causes white-nose syndrome eats into bats’ wings. 


Bacteria may help 
bats to fight fungus 


As white-nose syndrome spreads, researchers are trialling 
ways to stop colonies from collapsing. 


BY NALA ROGERS 


he bats at Marm Kilpatrick’s two Illinois 

field sites perished right on schedule. 

The mines sheltered nearly 30,000 bats 

before white-nose syndrome, a deadly fungal 

disease, arrived in late 2012. By March 2015, 
less than 5% remained. 

Kilpatrick, a disease ecologist at the 
University of California, Santa Cruz (UCSC), 
and his colleagues chose the mines because 
they lay right in the path of the fungus, which 
has spread from Europe through 26 US states 
and 5 Canadian provinces since January 2007. 

Although researchers are currently helpless 
to halt the spread of the fungus, there is reason 
for cautious optimism: treatments could soon 
be available that will help the bats to keep the 
infection at bay, for a season at least. 

The goal, says Chris Cornelison, a micro- 
biologist at Georgia State University in 
Atlanta, is to ensure that when researchers 
find long-term solutions for the disease, “there 
are still bats to treat”. 

The fungus (Pseudogymnoascus destructans) 
grows on bats while they hibernate in winter, 
digging into their noses, ears and wings. Ani- 
mals that survive until spring usually clear the 
infection as their bodies warm; some species 
do it year after year. But the pathogen causes 
other species to repeatedly rouse from hiber- 
nation, which burns up fat reserves and can 
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cause the animals to starve to death. Some even 
flee their roosts in a futile search for food. 

“They come out of caves in the winter, and 
they try to get into people's homes or churches 
or schools,” says Jeremy Coleman, national 
coordinator for white-nose-syndrome research 
at the US Fish and Wildlife Service in Hadley, 
Massachusetts. “They're dead and dying on the 
ground.” 

Kilpatrick and his colleagues have discovered 
that a bacterium found on bats’ wings may help 
them to combat infection. In April, the scien- 
tists published a paper in PLoS ONE showing 
that two strains of the bacterium Pseudomonas 
fluorescens kill white-nose fungus in cell culture 
(J. R. Hoyt et al. PLoS ONE 10, 0121329; 2015). 
Last winter, the researchers applied the treat- 
ment to bats in the lab. They have not released 
their results, but Kilpatrick hopes to test the 
bacteria in the wild soon. 

Others are examining whether volatile 
organic compounds produced by Rhodoc- 
occus bacteria, which are found in soils, can 
kill white-nose spores. During a field trial in 
Missouri last winter, Cornelison and his col- 
leagues treated bats with such compounds for 
48 hours before returning the caged animals to 
their cave for 4 months to finish hibernating. 
On 19 May, the researchers released the bats 
that were free of disease — prompting media 
coverage of a potential ‘cure’ 

Cornelison cautions against such 
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celebration. His team is still analysing the 
data from the trial, and has not revealed 
how many bats were treated and released 
or how the controls fared. And even if this 
or the bacterial treatments are effective, 
they will be only short-term solutions. The 
fungus lingers on cave walls during the 
summer, and bats do not seem to develop 
immunity to it — so researchers would 
need to treat the animals every year to keep 
them from getting sick. 


STOPGAP SOLUTIONS 

Long-lasting solutions remain elusive. 
Some scientists hope to develop a vaccine, 
but have yet to work out how to trigger the 
animals’ immune systems to destroy the 
pathogen, says Ken Field, an ecoimmunol- 
ogist at Bucknell University in Lewisburg, 
Pennsylvania. Bats naturally produce anti- 
bodies to the fungus, but there is no evi- 
dence that these can help them to survive. 

Other researchers are promoting a more 
radical long-term solution: altering airflow 
in mines where bats hibernate to make the 
sites less hospitable to the white-nose fun- 
gus. Places where bats survive infection 
tend to be relatively cool and dry. By open- 
ing new routes to the outside, researchers 
could cool and dehumidify the air in mines 
that are too warm and wet. 

There is a chance that manipulating 
airflow could drive bats to abandon the 
habitat, says Kate Langwig, an ecologist at 
the UCSC. But she argues that it is worth 
a try, because at some sites the white-nose 
fungus kills about 90% of the bats present. 
In the first 5 years that it was present in the 
United States, the pathogen claimed more 
than 5.5 million animals. 

In the meantime, the United States and 
Canada are developing and implementing 
strategies to coordinate work by scientists 
and by local and national governments 
— ranging from laboratory and field studies 
to efforts to prevent people from inadvert- 
ently spreading the fungus to pristine caves. 

The plight of the bats is “stark — it’s 
demoralizing”, says Winifred Frick, an 
ecologist at the UCSC. “But Ihave hope in 
terms of the amount of creative energy and 
sense of dedication that people are putting 
forth on this problem. If there are solutions, 
we will find them? = 
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First glimpse of 
primordial stars 


Astronomers claim to spot generation that seeded Universe. 


BY ELIZABETH GIBNEY 


explosions breathed carbon, oxygen and 

other elements into the Universe, may 
have been glimpsed for the first time. The 
possibility comes as a pleasant surprise to 
astronomers, who did not expect to be able 
to spot these primordial objects with existing 
telescopes. 

Primordial stars are theorized to be hun- 
dreds of times larger than the Sun, and made 
up only of pristine hydrogen, helium and 
traces of lithium left over from the Big Bang. 
The earliest specimens formed during the first 
few hundred million years of the Universe, liv- 
ing for only a few million years before explod- 
ing in supernovae that laid the seeds for the 
more element-rich stars to come. But they have 
never been seen. 

A team led by David Sobral, an astronomer 
at the University of Lisbon, now reports that 
it may have spied a late-blooming cluster of 
such stars, in the brightest distant galaxy yet 
observed. The stars, seen as they were when 
the Universe was around 800 million years old 
(a mere 6% of its current age), seem to be pri- 
mordial in composition — but, strangely, they 
also seem to reside in the same galaxy as some 
second-generation stars. 

“Until now, work on these stars has been 
completely theoretical,” says Sobral. “For 
the first time, we're starting to get observa- 
tions that can test the many theories about 
these stars and begin to understand how they 
formed.” His team’s report has been posted 
on the preprint server arXiv and accepted 
for publication in The Astrophysical Journal 
(D. Sobral et al. Preprint at http://arxiv.org/ 
abs/1504.01734; 2015). 

Seeing these ancient stars involves observing 
very distant galaxies. Their light takes billions 


S ome of the first generation of stars, whose 


Artist’s impression of galaxy CR7. 


of years to reach us, and so shows the Universe 
in its earliest days. But the light is faint, making 
it difficult to spot. The short lifetimes of the 
first stars also makes them hard to find. 

The surprise discovery emerged after 
Sobral and an international team of astrono- 
mers made a wide sweep of the sky using the 
Subaru Telescope on Mauna Kea in Hawaii. 
They used three further telescopes to peer 
into particularly bright galaxies, and found 
an intriguing signal from one that they named 
COSMOS Redshift 7. (The name was chosen 
so it could be abbreviated as CR7, to echo the 
nickname of Portuguese footballer Cristiano 
Ronaldo.) 

The spectrum of light from CR7 showed 
evidence of ionized helium, which suggested 
that the source of the light was extremely hot. 
At such temperatures, any carbon and oxygen 
present should also have ionized, says Sobral. 
But there was no sign of these elements in > 
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> the light, which strongly suggested that it 
came from the first generation of stars. 

CR7 also hosts second-generation stars, 
made from material spat out by dying first- 
generation stars, which means that it is not the 
sort of galaxy in which astronomers had imag- 
ined that they would make such a discovery. 
Sobral and his colleagues suggest that the pri- 
mordial stars may be late developers, formed 
from a cloud of pristine and uncontaminated 
gas that was prevented from cooling and coa- 
lescing by the heat of strong radiation from 
earlier-blooming stars. This later formation 


date would also explain why the stars were 
visible to Subaru. 

That primordial stars should turn up in a 
large and evolved galaxy presents a challenge 
to the group’s interpretation of CR7’s light 
signal, but is probably the least exotic of the 
possible explanations, says Naoki Yoshida, an 
astrophysicist at the University of Tokyo. One 
alternative, that the emission comes from a 
supermassive black hole that formed directly 
from the collapse of a pristine cloud of gas, 
would actually be “even more spectacular’, 


adds Sobral. 


Until now, many astronomers believed that 
seeing the first generation of stars would take 
an instrument such as NASA’s James Webb 
Space Telescope, which is projected to cost 
nearly US$9 billion, is scheduled to launch 
in 2018 and should have the power to look 
further back in time than any previous instru- 
ment. But if Sobral and his colleagues are right, 
it may be possible to see primordial stars with 
existing telescopes. Indeed, similar bright gal- 
axies have already been identified as potential 
candidates, adds Sobral. “We may already be 
looking at them.’ = 


An object estimated at 20 metres across exploded over Chelyabinsk, Russia, in 2013. 


PLANETARY SCIENCE 


Private asteroid 
hunt lacks cash 


Foundation fails to raise funds it needs for a space telescope 
to catalogue near-Earth objects. 


BY TRACI WATSON 


stronomy and science organizations 
At declared 30 June as Asteroid 
Day, with plans to talk up the danger 
of asteroids that might be ona collision course 
with Earth. One partner in the effort, the 
Sentinel mission, has an especially urgent need 
to drum up public support: it is struggling to 
raise the US$450 million it needs to launch a 
space telescope dedicated to finding hundreds 
of thousands of near-Earth objects. 
Astronomers have long sought a space- 
craft that would hunt for near-Earth objects 
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full-time, but public funding for such a 
programme has never materialized. Ground- 
based surveys have identified nearly all of the 
largest asteroids, but many of the space rocks 
that measure between 50 and a few hundred 
metres in diameter — big enough to wipe out 
a city with a direct hit — remain uncharted. 

“Without space-based assets, progress is 
going to stall} says physicist Mark Boslough of 
Sandia National Laboratories in Albuquerque, 
New Mexico. 

Sentinel, announced in 2012 by the 
B612 Foundation of Mill Valley, California, 
was supposed to provide that eye in the sky, 
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and NASA had hoped that the privately run 
effort would supplement the agency’s own 
asteroid-hunting programme. NASA is con- 
sidering funding an alternative mission, the 
Near-Earth Object Camera (NEOCam), 
initially proposed to the agency in 2006. But 
some scientists fear that Sentinel could spoil 
the chances of NASA supporting NEOCam. If 
Sentinel cannot fly, “then it’s just a distraction 
from getting a job done that needs to get done,” 
says planetary scientist Timothy Swindle of the 
University of Arizona in Tucson. 

Ground-based surveys have located roughly 
90% of the large asteroids that might be a con- 
cern — these are all bigger than one kilometre in 
diameter and pass by Earths orbit at a distance 
of 45 million kilometres or less. An impact by 
one of these behemoths would have global con- 
sequences; fortunately, such immense objects 
plummet to Earth on average only once every 
700,000 years or so, and none of those known is 
heading this way any time soon. 

The statistics are less reassuring for smaller 
objects. As of August 2014, according to 
data in a recent paper, surveys had spotted 
only 565 near-Earth asteroids ranging from 
roughly 45 to 55 metres across — out of an 
estimated total of up to 520,000 (A. W. Harris 
and G. D’Abramo Icarus 257, 302-312; 2015). 
An asteroid no larger than that damaged trees 
across 2,000 square kilometres of Siberian forest 
in 1908, in what is known as the Tunguska event 
(see ‘Direct hit’). Tunguska-sized bodies strike 
Earth once every 500 years or so, Boslough 
says. An object that exploded over Chelyabinsk, 
Russia, in February 2013 was estimated to have 
been about 20 metres in diameter. 


PLANETARY PROTECTION 

Sentinel aims to ensure that a similar fate does 
not unexpectedly befall a place the size of Lagos 
or London. Mission director Harold Reitsema, 
who helped to design instruments on the Hub- 
ble Space Telescope, says that Sentinel will use 
an infrared detector to find 90% of near-Earth 
objects larger than 140 metres across, and will 
also spot a good number of objects measuring 
30 metres or more. If an asteroid were found to 
be headed for Earth long enough before impact, 
authorities could try to knock it off course. 
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SOURCES: G. LONGO ET AL., 2005 (TREEFALL); NRC, 2010 (IMPACT ENERGIES) 


Or they could plan to evacuate the target area. 

To help Sentinel along, in 2012 NASA signed 
an agreement with the B612 Foundation to 
provide analytical support and, once the space- 
craft is in orbit, a data downlink. The space 
agency has an interest in Sentinel’s success; 
it is under orders from Congress to find 90% 
of all near-Earth objects 140 metres or bigger 
by 2020. NASA will miss that target whether 
Sentinel launches or not, but a good space 
telescope could help to build a catalogue more 
quickly than ground-based surveys alone, 
which would need perhaps several decades. 

“I don't want anybody to think that B612 
is going to save the planet, but they are doing 
what we need to do,’ NASA administrator 
Charles Bolden said in 2013 when Congress 
pressed him on asteroid-detection efforts. 

Yet progress has been slow. The B612 Foun- 
dation raised donations of roughly $1.2 million 
in 2012 and $1.6 million in 2013 — far short 
of its annual goal of $30 million to $40 million. 
NASA says that Sentinel has also missed 
every development milestone laid out in the 
2012 agreement. In a January statement to an 
advisory panel, NASA said that its “reliance 
on the private sector for a space-based NEO 
survey ... is being re-examined”. NASA's 
Lindley Johnson, director of the near-earth 
object programme, declined to speak to 


DIRECT HIT 


In the 1908 Tunguska event, an object exploded over Siberia, 
damaging trees across 2,000 square kilometres. If a similar event 
occurred above New York City, the damage would hit all five boroughs 
and beyond. Impact energies for different sized objects are shown. 
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Nature, citing the ongoing discussions between 
the B612 Foundation and the agency. 
NEOCam, meanwhile, would use an infra- 
red telescope to search for asteroids from a 
vantage point between Earth and the Sun. In 
September, NASA will decide whether it is a 
finalist out of more than two dozen proposals 
being considered for launch by 2022 through 
the Discovery programme, which caps each 


mission’s cost at $450 million. 

If Sentinel receives substantial funding 
soon, it could launch by late 2019, says B612 
Foundation chief executive and former astro- 
naut Edward Lu. Even if NASA terminates its 
agreement with the foundation, he vows to 
keep the project going. “Believe me, I could do 
a lot of other things,” he says. “But I feel this is 
extremely important?” = 


Earth science wrestles with 
conflict-of-interest policies 


Industry-funding controversies highlight lack of standards among field’s journals. 


BY JEFF TOLLEFSON 


/ | “ undisclosed industry ties of some 
authors of Earth-science papers have 
raised ethical questions about how the 

field handles conflicts of interest. The cases of 

global-warming sceptic Willie Soon and hydrol- 
ogist Donald Siegel have inspired calls for a uni- 
form policy on reporting funding from entities 
that have an interest in the outcome of research. 

“The Earth-science community doesnt really 
have a coherent set of policies for dealing with 
this,” says Naomi Oreskes, a science historian at 
Harvard University in Cambridge, Massachu- 
setts, who co-authored a 12 June commentary’ 
in Environmental Science e& Technology calling 
for stronger disclosure rules. 

Soon, a solar physicist at the Harvard-Smith- 
sonian Center for Astrophysics co-authored a 
2010 paper’ on climate-change policy in Ecol- 
ogy Law Currents, but he did not disclose fund- 
ing from Southern Company, an electricity 


provider in Atlanta, Georgia. The company 
has lobbied against stronger regulations to 
limit greenhouse-gas emissions. 

The journal's editors told the Climate Inves- 
tigations Center (CIC), a watchdog group in 
Alexandra, Virginia, that they do not have a 
conflict-of-interest policy, but are “exploring 
the possibility”. 

Soon also published a study’ in March in 
Nature Geoscience on atmospheric conditions 
during the Little Ice Age — a 500-year cool 
period that began around the 1400s — with- 
out reporting his funding. Nature Publishing 
Group (which also publishes Nature) says that 
Soon complied with its policies, which require 
the disclosure of financial ties that are relevant 
to the research in question. 

But even if the study had little relevance to 
climate policy, Oreskes says Soon's funders ben- 
efit from any paper in a peer-reviewed journal. 
Such funding relationships can create an uncon- 
scious bias and should be reported, she says. 
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Still, having disclosure policies does not 
guarantee that scientists will abide by them, 
as the CIC found when it examined other 
publications that Soon listed as “deliverables” 
in reports to Southern Company. 

In 2009, Soon and others published a study* 
in the Journal of Climate on the variability of 
monsoons. The journal requires authors to 
disclose all funding sources and “any finan- 
cial arrangement with a research sponsor that 
could give the appearance ofa conflict of inter- 
est.” Soon’s co-authors acknowledged support 
from conventional granting agencies. But Soon 
did not report his funding from Southern Com- 
pany. After the CIC released documents in Feb- 
ruary revealing Soon’s industry ties, the journal 
amended the study to clarify his funding. 

In another case, hydrologist Donald Siegel 
of Syracuse University in New York came 
under fire for a study” that concluded that 
fracking — which uses pressurized fluids to 
shatter rock and release trapped natural gas > 
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> —had not contaminated groundwater in 
Pennsylvania. Siegel did not disclose that the 
Chesapeake Energy Corporation of Okla- 
homa City, Oklahoma, had paid him and 
provided the water samples that his team 
analysed. Environmental Science & Technol- 
ogy, which published the analysis, posted a 
correction in April after the media revealed 
Siegel's links to the company. 

Siegel says that he thought that this 
relationship was obvious, because he was 
working on a summer contract with the 
company’s consultants, some of whom were 
listed as co-authors. “I never really antici- 
pated this, but perhaps I was naive,’ Siegel 
says. “Because of the public nature of some 
of this science, I think we probably need a 
much tighter rein on what disclosure is.” 

Disclosure demands are coming from 
both sides of the aisle: climate sceptics have 
objected to a Nature Climate Change study® 
that analysed some of the US Environ- 
mental Protection Agency’s greenhouse- 
gas regulations, because it was written by 
researchers who have received grants from 
the agency — even though the grants are 
listed in a public database. 

Some experts suggest that Earth scien- 
tists should look to the biomedical com- 
munity for guidance. Many biomedical 
journals require authors to fill out a com- 
mon disclosure form that publishers devel- 
oped in 1978 and have continued to update. 
And universities and hospitals often require 
medical researchers to report each year on 
their financial arrangements with industry. 

Eric Campbell, a sociologist at Harvard's 
Edmond J. Safra Center for Ethics, says that 
biomedical scientists, funding agencies and 
journals have better-established disclosure 
policies because their work often involves 
human participants, and because of the 
strong financial ties between academia and 
the pharmaceutical industry. 

There are no plans yet for the Earth- 
science community to develop disclosure 
standards. But there may be nascent backing 
for such an effort. Véronique Kiermer, direc- 
tor of author and reviewer services at Nature 
Publishing Group, says that the publisher 
“would be supportive of discussions in the 
Earth-sciences community about the spe- 
cific challenges of the field and a framework 
for the standards of transparency it requires:’ 

Campbell says that such a group discus- 
sion would be useful. “You dor’t want the 
individual with the conflict making deci- 
sions about what they should do about it? m 
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A plastic cast of the skull of Kennewick Man. 


Genome results 
rekindle legal row 


‘Kennewick Man’ sequencing shows Native American roots. 


BY EWEN CALLAWAY 


he genome of a famous 8,500-year-old 
[ise American skeleton, known as 

Kennewick Man, shows that he is closely 
related to Native American tribes that have for 
decades been seeking to bury his bones. The 
finding seems likely to rekindle a legal dispute 
between the tribes and the researchers who 
want to keep studying the skeleton. Yet it comes 
at a time when many scientists — including 
those studying Kennewick Man — are trying to 
move past such controversies by inviting Native 
Americans to take part in their research. 

“The controversy has been painful for lots of 
people; tribal members and scientists as well,” 
says Dennis O'Rourke, a biological anthropolo- 
gist at the University of Utah in Salt Lake City. 

Soon after the skeleton’s bones were 
unearthed in 1996, near the shores of the 
Columbia River near Kennewick, Washington, 
several local tribes demanded the return of Ken- 
newick Man, whom they dubbed the Ancient 
One. The US Army Corps of Engineers — the 
federal agency that manages the land where the 
remains were found — sided with the tribes, 
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citing a 1990 law that mandates the return of 
Native American remains and artefacts to affili- 
ated tribes: the Native American Graves Protec- 
tion and Repatriation Act (NAGPRA). 

Several archaeologists and anthropologists 
sued the US government to stop the return, 
arguing that Kennewick Man was too old to be 
connected to the tribes. In 2002, a federal judge 
ruled that NAGPRA did not apply because the 
US goverment had not established that the 
tribes had a cultural affiliation with Kenne- 
wick Man. The decision was affirmed by a US 
Appeals Court in 2004. 

A team led by Eske Willerslev, a palaeo- 
genomicist at the Natural History Museum of 
Denmark in Copenhagen, used cutting-edge 
methods to extract snippets of DNA from a 
flake of finger bone and then sequence the 
genome. The researchers then compared 
the genome to that of members of dozens of 
groups from across North and South Amer- 
ica, including several members of the Con- 
federated Tribes of the Colville Reservation, 
one of the five groups seeking Kennewick 
Man’s reburial (M. Rasmussen et al. Nature 
http://doi.org/5jb; 2015). Members of the 
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other four groups — the Nez Perce, the Uma- 
tilla, the Yakama and the Wanapum — opted 
not to contribute DNA to the study. 

The Colville members were more closely 
related than were many North and South 
American tribes, but some other tribes still 
shared close ancestry with the 8,500-year-old. 
“We can't say that the Colville are the closest 
living descendants of Kennewick Man, because 
the reference panel is too small,’ warns Willer- 
slev. “But I think we can say that Colville is very 
closely related to Kennewick Man” 

Gail Celmer, an archaeologist at the US 
Army Corps of Engineers in Portland, Ore- 
gon, says that the Colville people are still eager 
to pursue repatriation. Her agency therefore 
plans to reconsider whether Kennewick Man 
falls under NAGPRA. “We expect challenges, 
so were going to have to be very careful about 
how we do our reviews,’ she says. 

Hank Greely, a legal scholar at Stanford 
University in California, notes that with a 
genome sequence analysed, “it’s in the best 
interest of the scientists, of the government, 
and of Native Americans to think seriously 
about giving Kennewick Man’s remains back 
to the tribes”. 

“The whole point of the Kennewick Man 
case was to ensure that important discoveries 
like this had an opportunity to be made,’ says 
Doug Owsley, a forensic anthropologist at the 


Smithsonian National Museum of Natural His- 
tory in Washington DC and one of the scientists 
who sued the US government to gain access to 
the remains. “If I had to do it again, I would” 

James Chatters, an anthropologist who exca- 
vated Kennewick Man and who has since set 
up the consulting firm Applied Paleoscience in 
Bothell, Washington, is torn. “As a person who 
worked directly with the skeletal remains, I'd 
like to see them in peace,’ he says. “As a scien- 
tist, | would hate to see one of the most complete 
North American skeletal remains be put back 
into the ground for political reasons” 


CLOSE TIES 
Willerslev is sticking to the sidelines on the issue 
of repatriation. “It’s somebody else who needs 
to figure that one out,” he says. But he wants 
the tribes to be involved in his research; after 
his team determined that Kennewick Man was 
Native American, he informed the five tribes 
about the conclusion. Some of the members 
travelled to his Copenhagen lab to learn more 
about the research, donning full-body suits 
to visit the clean lab in which ancient DNA is 
extracted from remains. The lab did similar out- 
reach with tribes in Montana after it sequenced 
the genome of the ‘Anzick boy (M. Rasmussen 
et al. Nature 506, 225-229; 2014), helping to 
broker a deal to rebury those bones. 

Many other researchers are taking a similar 
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approach. O’Rourke says that there is no one- 
size-fits-all strategy to working with native com- 
munities. He finds some of the North American 
Arctic groups he works with eager to contribute 
to his research; others are less so, and their opin- 
ions shift over time. 

“We really have to change the top-down 
approach, where we come to people and say 
‘these are our research questions and you should 
participate, because — SCIENCE,” says Jennifer 
Raff, an anthropological geneticist at the Uni- 
versity of Texas at Austin. 

Just weeks before Kennewick Man’s remains 
were discovered, researchers working in Alaska 
discovered a 10,000-year-old human skeleton. 
They notified local tribes and quickly came to 
an agreement that allowed them to excavate and 
study the remains and keep the tribes involved 
in the research. “You dontt really hear so much 
about the good cases,” says Raff. m 


See go.nature.com/cnizsi for a longer version. 


CORRECTION 

The News story ‘US “export rules” 

threaten research’ (Nature 522, 266-267; 
2015) should have said that information 
developed through fundamental research 
— rather than all unclassified information — 
is considered to be in the public domain. 
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When the Francis Crick Institute opens in London this year, it will be Europe’s largest 
biomedical research centre. Can director Paul Nurse make this gamble pay off for UK science? 


BY EWEN CALLAWAY 


idea”. The Nobel-prizewinning geneticist proposed bringing two 

of London's most esteemed biomedical-research centres under 
one big tent. Literally. He suggested building a megalab inside the 
Millennium Dome, a 1-kilometre-round, 52-metre-high open-plan 
exhibition centre with a plastic and fibreglass roof on the Greenwich 
Peninsula in southeast London. “It was crazy, because it was not built 
as a research institute,” says Nurse. “It got nowhere,” 

Actually, Nurse's idea did go somewhere: about 10 kilometres north- 
west, right in the heart of London. In November, he will open the doors to 
the Francis Crick Institute, a lab even more ambitious than he originally 
conceived. When the £700-million (US$1.1-billion), 93,000-square-metre 
institute reaches full capacity by 2021, it will house some 1,600 scientists 
and support staff, making it Europe's single largest biomedical lab. 

Vast expectations come with that size. Nurse and the UK govern- 
ment have advertised the Crick as a boon to British science — one that 
will generate groundbreaking discoveries, lure the world’s most bril- 
liant young researchers to the United Kingdom and boost the British 


f ifteen years ago, Paul Nurse had what he now calls “a really stupid 
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life-sciences industry. And it will do so while eschewing departments, 
permanent positions and any sort of scientific focus. “It is quite a differ- 
ent sort of beast,” says Nurse. “I sometimes think not everybody realizes 
what's been put together here” 


If it works, this eccentric experiment in scientific infrastructure could 


guide how other countries form their labs of the future. But some ques- 
tion whether the Crick will have been worth the risk. They warn that 
the centre could become a black hole that devours public and private 
research funding, while widening the gap between the haves and have- 
nots of UK science. “It is a big, big, big, enormous whale of an investment,” 
says James Wilsdon, a science-policy researcher at the University of Sussex 
in Brighton, UK. “It has to succeed. If it doesn’t, what do we do?” 


BIG AMBITIONS 

The Crick was born of necessity. By the late 1990s, two of the United 
Kingdom's most esteemed biomedical-research institutions needed to 
find new homes. The pre-Second World War building that housed the 
National Institute for Medical Research (NIMR), in Mill Hill, north 
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London, was showing its age. So were some lab facilities supported by 
the Imperial Cancer Research Fund, a charity later renamed Cancer 
Research UK (CRUK). Nurse was head of that charity when he proposed 
the Millennium Dome megalab. But rather than pursue that dream, he 
left England in 2003 to lead the Rockefeller University in New York City. 

Nurse's crazy idea lingered nonetheless. An influential 2006 govern- 
ment review, the Cooksey report, concluded that the country was “at 
risk of failing to reap the full economic, health and social benefits 
that the UK’s public investment in health research should generate’, 
and called for increased collaboration between funders and greater 
innovation in biomedical research. So when then-NIMR head Keith 
Peters revived Nurse’s idea of merging the two institutions, it gained 
traction. Peters settled on a plot of land just north of central London, 
between the British Library and St Pancras railway station. To pay 
for the new centre, he brought in nearby University College London 
and the Wellcome Trust biomedical charity; these were later joined 
by King’s College London and Imperial College London, which con- 
tributed £40 million each to construction costs. The universities will 
also house researchers at the Crick, especially physical scientists and 
engineers, and give the institute access to their associated hospitals. 

The Crick, which is named after the co-discoverer of the structure 
of DNA, is billed as a basic-research institute. However, its funders are 
eager to see its discoveries turn into treatments. Researchers study- 
ing everything from stem-cell development to influenza will have 
access to drug-screening robots, high-powered microscopes and up to 
200,000 animals, mostly mice and zebrafish. 

The institute will evolve in its early years. The 90 or so group lead- 
ers currently at the NIMR and CRUK’s London Research Institute 
(LRI) will be its first occupants when they start moving in early next 
year. The rest of the roughly 120 group-leader slots will go to young 
researchers recruited to work at the Crick for a maximum of 12 years — 
a 6-year initial stint and a 6-year extension, subject to a positive review. 
Researchers from the NIMR and the LRI who already have permanent 
posts will keep them, but as they retire, the proportion of tenured group 
leaders will shrink to around one-third of group leaders. The regular 
staff turnover, says Nurse, will free the Crick to follow the latest research 
trends, be they stem cells or genome editing. “A constantly rejuvenating 
institute stops things from becoming too inward-looking and ossified.” 


LAB AS EXPERIMENT 

“Gentle anarchy” is how Nurse has described his vision for the Crick. 
Researchers there will not belong to departments or divisions, and their 
location in the super-lab will depend on the core facilities they tend 
to use, from drug-screening robots to cell sorters. In Nurse’s vision, 
researchers will form grass-roots interest groups on topics such as 
the microbiome. “Most people think it’s crazy — you have no sorts of 
structures,” says Nurse. “I think that it’s very exciting and invigorating 
because the structure comes bottom up.” 

Tom Cech, a Nobel-prizewinning biochemist at the University of 
Colorado Boulder and former president of the Howard Hughes Medical 
Institute (HHMI) in Chevy Chase, Maryland, says that science funders in 
other nations will watch the Crick closely. He has some experience with 
unusual ventures. During Cech’s tenure, the HHMI opened the Janelia 
Farm Research Campus, a neuroscience institute near Ashburn, Virginia, 
that kept group sizes small, jettisoned tenure and emphasized interdisci- 
plinary research. “If nobody does these experiments, the question remains 
hypothetical. Experiments like Janelia Farm, like the Crick, are important 
for society, says Cech, because they can inspire new ways of doing science. 

The Crick’s supporters are also counting on the institute to be an 
economic engine for Britain by bridging the gap between the lab and 
market. Nurse’s number two, chief operating officer David Roblin, was 
head of European research and development at pharmaceutical giant 
Pfizer until 2011, and has since worked for smaller biotechnology com- 
panies. “Although I’m an industrial scientist, my instructions are not 
to make it a pharmaceutical company,’ Roblin says. “The Crick is a 
discovery research institute that is very interested in translation.” Roblin 
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Paul Nurse inside the Francis Crick Institute. 


“MOST PEOPLE THINK IT'S CRAZY — 
YOU HAVE NO SORTS OF STRUCTURES. | 
THINK THAT IT'S VERY EXCITING.” 


is trying to lure pharmaceutical companies to place researchers at the 
Crick in the hope of speeding the transition to the clinic. But unlike at 
a private company, any developments would be openly reported. And 
the intellectual property would belong to the Crick. 

Even ina city filled with glass and steel megastructures, the Francis 
Crick Institute is an imposing landmark. Nurse, who was knighted in 
1999, says that his employees sometimes call it “Sir Paul’s cathedral”. 

Perched on the edge of the city’s revitalized King’s Cross area, the build- 
ing dwarfs the neighbouring British Library — the biggest public structure 
erected in Britain in the twentieth century. The Crick’s twin, curved roofs 
give the appearance of an overturned ship, its hull split in two. 

The institute feels even larger inside, where hard hats, safety glasses 
and steel-toed boots were still mandatory as of May. Hallways of glass- 
fronted open-plan labs stretch almost the length of two football fields 
on either side of an expansive atrium, over several lofty floors. To orient 
people, the walls are coloured by floor and a smartphone app is being 
developed (“I think were blue. I can’t remember,’ says one scientist who 
will move her lab to the Crick next year). Its four basement levels will 
house sensitive equipment such as electron microscopes, shielded by 
concrete slabs to block the ever-present vibrations from nearby train 
and tube lines. Construction has begun on a high-containment lab for 
working on influenza and other deadly pathogens, as well as facilities for 
a menagerie of lab animals, including the only collection of opossums 
used for research in the United Kingdom. 

Like many new laboratories, the Crick is designing spaces to encour- 
age interactions between researchers. Each above-ground floor centres 
around meeting rooms, tea and coffee stations and ‘collaboration spaces, 
meant to increase the odds of fruitful encounters. 

“That's one of the things I’m most excited about,’ says James Briscoe, a 
developmental biologist who is moving to the Crick from the NIMR. He 
has already met the group leaders with whom he will soon share bench 
space and equipment. Caetano Reis e Sousa, an immunologist com- 
ing from the LRI, says that he, like most other principal investigators, 
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Vital signs 


As Europe’s largest biomedical centre, the Francis Crick Institute looms large in 
many dimensions, from the size of its budget to the number of its personnel. 


Money matters Room to roam 
On top of its £700-million 
(US$1.1-billion) construction 
costs (below), the centre will have 


an annual budget of £150 million. 
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Palace of science 
The Crick dwarfs Buckingham Palace 
in several measures. 
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already works with colleagues elsewhere. It is the graduate students, 
postdocs and technicians who will benefit most from the Crick’s social 
engineering, he says. “They are the ones who will establish collabora- 
tions and connections that will allow new things to be done” 

However, not everyone buys the argument that a giant fish bowl 
is needed for immunologists to have tea with physicists, the kind of 
interaction embodied by an unofficial mantra in many Crick docu- 
ments and videos: “discovery without boundaries”. The phrase grates 
on some researchers. “The assumption is, previously, discovery had ter- 
rible boundaries,’ says Kieron Flanagan, a science and technology policy 
researcher at the University of Manchester, UK. “And now that they're 
all in one building, suddenly the discoveries will flow. It’s just bollocks.” 

The Crick’s size is another sticking point. “Everybody believes that 
science has to get bigger and more capital intensive,” says Flanagan. 
“There's a good bit of evidence that says small groups are more produc- 
tive than large ones.” 

And its location presents challenges. Building the Crick in one of 
Europe's biggest cities will be crucial to convincing the best young 
researchers from around the world to move to the United Kingdom, 
says Nurse. The city’s hospitals offer a population of potential clinical- 
trial participants not matched anywhere else in the United Kingdom. 

But the institute has little room to expand, and has built several 
facilities off site, including a data centre and animal-breeding facility. 
A planned trans-London underground railway line, proposed to pass 
near the site, could overwhelm the vibration-dampening protections 
and jiggle the institute's sensitive scientific instruments (see Nature 
518, 464-465; 2015). And then there is the cost of living in London, 
which has skyrocketed since plans for the Crick were unveiled. Nurse 
had hoped to buy a block of flats for postdocs and graduate students, 
but he could not find the money. He is now working on smaller 
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measures to make London more affordable for Crick scientists. 

Gerald Rubin, executive director of the Janelia campus, sees the lack 
of subsidized housing as the institute's only weakness — but worries that 
it could hamper recruitment. “If the Crick had that, they would probably 
rapidly become the premier research institute in Europe. If they don't, 
I think they'll struggle,” he says. 


RIPPLE EFFECTS 

Nurse says that the Crick will serve as a scientific incubator for the 
United Kingdom: he hopes that researchers leaving the Crick will take 
their talents to universities throughout the country. “I say half joking 
that they should fall in love with somebody who lives here,” he says. 

Critics, however, say that Nurse is being glib about the serious eco- 
nomic challenges faced by young Londoners (a postdoc or even principal 
investigator's salary will not buy a flat anywhere near the institute). And 
they question whether the Crick will be the incubator that Nurse and the 
government contend. “I don’t see it as being possible to define that as a 
strategy for the institute — to populate the UK,” says David Stephens, a 
cell biologist at the University of Bristol. “It could, but there’s no de facto 
reason why it will” Researchers could just as well move to world-class 
centres in the United States, Germany or Asia, say Stephens and others. 

Some worry that the Crick will become ‘too big to fail’ — that its 
public and private funders will put the centre ahead of other priorities. 
Beyond its £700-million construction costs, the institute will have an 
annual budget topping £150 million per year (see ‘Vital signs’), which 
will be paid by the government’s Medical Research Council (MRC), 
CRUK and the Wellcome Trust. A flat or even diminished government 
science budget — increasingly likely after the election this year of a 
Conservative government that promises cuts — has amplified such 
concerns. “Taking a large scoop out of the pot for one institute makes it 
harder for the rest of us to attract funding,’ says Stephens, whose lab is 
supported by the MRC and the Wellcome Trust. Meanwhile, Nurse is 
currently leading a government review of how the UK research councils 
(which include the Crick’s major funder, the MRC) divvy up money, 
raising conflict-of-interest concerns. 

Nurse says that such complaints are unfounded “scare stories” from 
scientists who are “terrified about their own funding”. He sees the scale of 
the Crickas a strength, which will help the institute in terms of both con- 
ducting world-leading research and securing financial support. “Having 
a single large institute, which has been invested in and where people have 
gota lot of skin in the game, is indeed a good encouragement to make the 
thing work,” he says. “But it means we have a responsibility to deliver” 

But deliver what? Gauging the ultimate success or failure of the Crick 
will be difficult. The researchers who will soon move into the institute 
are already leading scientists, and will continue to produce high-level 
work. But that is not enough. Cech says that the Crick should be judged 
on whether it contributes to discoveries that could not have been made 
by university researchers with the same funds. Wilsdon agrees: “It’s 
going to have a lot of resources in it. It needs to do more than margin- 
ally outperform its competitors.” 

Nurse is another question mark. He plans to helm the Crick for 
another five or six years before stepping aside as director. Matthew 
Freeman, a molecular biologist at the University of Oxford, UK, won- 
ders whether a successor will be able to carry his vision forward. “My 
sense is this is very much designed in a way Paul wanted it to be. He's put 
a lot of his own personality and weight and charisma behind establish- 
ing this rather unusual model,’ says Freeman. “I see this as an exciting 
experiment, but experiments can fail as well as succeed.” 

Nurse concedes that the Crick’s loftier goals — creating a nimble, 
highly collaborative basic-research institute that powers Britain's knowl- 
edge economy — are less than certain. “I think we've got a reasonable 
chance,’ he says. But he is open to change. “What if my lack of depart- 
ments is chaos? If it’s chaos, I'll put the white flag up and say, right, we'll 
do something else,” he says. “Iam not a zealot.” m 


Ewen Callaway writes for Nature from London. 
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DAYTOUIN 


As researchers work out how oxytocin affects the brain, the hormone 
is shedding its reputation as a simple cuddle chemical. 


n April 2011, Robert Froemke and his team were 
reprogramming the brains of virgin mice with a single 
hormone injection. 

Before the treatment, the female mice were largely indifferent 
to the cries of a distressed baby, and were even known to trample 
over them. But after an injection of oxytocin, the mice started 

to respond more like mothers, picking up the mewling pup in their 
mouths. Froemke, a neuroscientist at New York University’s Langone 
Medical Center in New York City, was monitoring the animals’ brains 
to find out why that happened. 

At first, the mice showed an irregular smattering of neural impulses 
when they heard the baby’s cries. Then, as the oxytocin kicked in, the 
signal evolved into a more orderly pattern typical of a maternal brain. 
The study showed in unusual detail how the hormone changed the 
behaviour of neurons’. “Oxytocin is helping to transform the brain, 
to make it respond to those pup calls,” Froemke says. 

Oxytocin has been of keen interest to neuroscientists since the 
1970s, when studies started to show that it could drive maternal behav- 
iour and social attachment in various species. Its involvement in a 
range of social behaviours’, including monogamy in voles, mother- 
infant bonding in sheep, and even trust between humans, has earned 
it a reputation as the ‘hug hormone’ “People just concluded it was a 
bonding molecule, a cuddling hormone, and that's the pervasive view 
in the popular press,” says Larry Young, a neuroscientist at Emory 
University in Atlanta, Georgia, who has been studying the molecule 
since the 1990s. 

That view has led some clinicians to try oxytocin as a treatment for 
psychiatric conditions such as autism spectrum disorder. But the early 
trials have had mixed results, and scientists are now seeking a deeper 
understanding of oxytocin and how it works in the brain. Researchers 
such as Froemke are showing that the hormone boosts neuronal signals 
ina way that could accentuate socially relevant input such as distress 
calls or possibly facial expressions. And clinical researchers are starting 
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a wave of more ambitious trials to test whether oxytocin 
can help some types of autism. 

The work is leading to a more sophisticated view of the 
hormone and its complex effects on behaviour — one that will take 
many types of expertise to refine. “The oxytocin field has just matured 
and ripened enough to draw in researchers from traditionally separate 
fields, catapulting this forward, says Young. 


BIRTH ACCELERATOR 

Oxytocin’s story starts back in the early 1900s, when biochemists 
discovered that a substance from the posterior pituitary gland could 
promote labour contractions and lactation. When scientists later dis- 
covered the hormone responsible, they named it oxytocin after the 
Greek phrase meaning ‘rapid birth. Oxytocin is produced mainly by 
the brain’s hypothalamus; in the 1970s, studies revealed that oxytocin- 
producing neurons send signals throughout the brain, suggesting that 
the hormone had a role in regulating behaviour. 

Ina landmark 1979 study’, Cort Pedersen and Arthur Prange at the 
University of North Carolina in Chapel Hill showed that giving oxytocin 
to virgin rats could trigger maternal behaviours: the animals would build 
nests, lick or crouch over unfamiliar pups and even return lost pups to 
the nest. Researchers went on to show that oxytocin signalling in the 
brains of prairie voles (Microtus ochrogaster) helps the animals to form 
lifelong pair bonds* — a rarity among mammals. In 2012, researchers 
even found a version of oxytocin in the tiny roundworm Caenorhabditis 
elegans, where it helps the animals find and recognize mates’. 

“This is a very ancient molecule,’ says Sue Carter, a neuroscientist 
at Indiana University in Bloomington, whose lab pioneered many of 
the early studies of oxytocin in voles. “It has been used and reused for 
many purposes across the evolution of modern animals, and almost 
everybody who’ tried to look at an effect of oxytocin on anything like 
social behaviour has found something” 

In mammals, many mysteries remain. Oxytocin is difficult to 
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measure reliably in the brain, making it hard to know 
exactly where, when and how much is normally 
released; nor do scientists understand precisely how it 
works to alter behaviour. “What we need to start think- 
ing about is the more fundamental role that oxytocin 
plays in the brain,” Young says. The determination to 
find out has been strengthened by a growing move in 
neuroscience to characterize circuits that are important 
in brain operations. “That's the level that’s critical for 
understanding how the brain is regulating behaviour,’ 
says Thomas Insel, director of the US National Insti- 
tute of Mental Health in Bethesda, Maryland, who has 
studied oxytocin in voles. 

At Langone, Froemke focused on the circuits under- 
lying the maternal response to pup cries — a behaviour 
that helps females to retrieve helpless newborns that can get lost when 
a mother is moving her nest. He focused on the left auditory cortex, a 
brain area thought to be involved in detecting the pups ultrasonic cries. 

Froemke’s study', published in April, showed that oxytocin 
temporarily suppresses inhibitory neurons — those that dampen neural 
activity — which allows excitatory cells to respond more strongly and 
reliably. “Our hypothesis is that the virgin brain is a blanket of inhibi- 
tion, and that pairing the pup calls with oxytocin allows the network 
to be reconfigured,” says Froemke. The hormone may serve to amplify 
incoming signals and allow them to be recognized as behaviourally 
important. (It is at least possible, he says, that this same mechanism 
could explain why some human mothers feel they are uniquely tuned 
to a baby’s cries.) 

“The study is kind of a high-water mark for the field, putting differ- 
ent levels all together: a robust behaviour, a brain region, and a cellular 
basis for it” says Richard Tsien, a neuroscientist also at Langone. Tsien 
has been studying the action of oxytocin on neuronal circuits in detail, 
by examining slices of the hippocampus, a region involved in learning 


“WHAT WE NEED T0 
START THINKING 
ABOUTIS THE MORE 
FUNDAMENTAL ROLE 
THAT OXYTOCIN HAS 
IN THE BRAIN.” 


and memory. Ina 2013 study’ of rats, Tsien’s team found 
that oxytocin selectively acts on a type of cell called an 
inhibitory interneuron in a way that quiets background 
chatter within the neuronal circuit. “Oxytocin improved 
signal transmission, almost doubling the ability of 
information to flow through the system,” Tsien says. In 
effect, it is producing more signal and less noise. 
Froemke’s and Tsien’s work fits into a broader theory: 
that one way oxytocin helps social interaction and rec- 
ognition is by enhancing the brain’s response to socially 
relevant sights, sounds or other stimuli. Young has 
shown that the hormone helps mice to recognize and 
pay attention to the smells of other mice’; others found 
that it promotes people’s ability to recognize faces’. 
The hormone does not act alone. In 2013, 
neuroscientist Robert Malenka at Stanford University in California and 
his colleagues showed that oxytocin works together with the neuro- 
transmitter serotonin to reduce the excitability of neurons in the nucleus 
accumbens’, a brain region involved in reward. This process seems to 
support the preference of mice to return to environments where they had 
rewarding social interactions with other animals. “Oxytocin is part ofa 
system, Carter says, “and it’s not the only molecule that matters, but it’s 
one that in some way is regulatory over a large number of other systems.” 


MATTER OF TRUST 

The rapid evolution in basic research has been accompanied by a boom 
in clinical interest. Oxytocin has been used since the 1950s to acceler- 
ate childbirth, so many researchers consider it relatively safe to use in 
experiments. 

About ten years ago, psychology studies started to show that single 
doses of oxytocin, delivered through an intranasal spray, could pro- 
mote various aspects of social behaviour in healthy adults. People who 
inhaled oxytocin before playing an investment game were more willing 
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to entrust their money toa stranger than were placebo-treated players”. 
A dose of the hormone also increased the amount of time that people 
spent gazing at the eye region of faces", and improved their ability to 
infer the emotional state of others from subtle expressions”. 

The idea that oxytocin is central to social cognition made it an 
attractive candidate for treating psychiatric disorders, especially autism 
spectrum disorder. People with this condition, who often have problems 
with social interaction and communication, may not 
process social stimuli appropriately — and scientists 
theorized that oxytocin might reverse some of the 
symptoms. Beginning in 2010, results emerged that 
seemed to support this theory: researchers found that 
single puffs of oxytocin could temporarily improve 
measures of empathy and social cooperation in peo- 
ple with autism spectrum disorder. 

“People got quite excited, recalls clinical neuro- 
scientist Evdokia Anagnostou, who co-directs the 
Autism Research Centre at Holland Bloorview Kids 
Rehabilitation Hospital in Toronto, Canada. But 
Anagnostou says that some preliminary steps were 
skipped over as researchers rushed to test oxytocin as 
a psychiatric drug. “To be honest, if we had done it properly, we wouldn't 
have done it the way we did. It wenta little bit too fast,’ she says. Because 
oxytocin had cleared the early, standard steps of drug development dec- 
ades earlier, some researchers did not systematically test a range of doses 
to see whether they had differing psychological effects. 

Many early studies of oxytocin for autism were limited because they 
assessed only a single dose and had relatively few participants, and later 
experiments with more doses failed to show the same promise. In 2010, 
clinical psychologist Adam Guastella at the University of Sydney in Aus- 
tralia studied 16 male adolescents with autism spectrum disorder, and 
found that one dose of oxytocin could improve their ability to gauge the 
emotions of others by looking at their eyes’. But when he tried giving 
twice-daily doses of the hormone for two months, he found no signifi- 
cant improvements in social interaction or social cognition”. “Studies 
to this point have really shown limited benefit of oxytocin in improving 
psychiatric illnesses over time,” he says. Guastella says that getting to the 
bottom of oxytocin’s complex neurological effects will take time. “If we 
want a simple answer, we're not going to get it.” 


ALL IN THE DETAIL 

So far, few studies have definitively linked autism to problems in 
oxytocin signalling. Some of the clearest evidence emerged in February, 
from a team led by neurogeneticist Daniel Geschwind of the University 
of California, Los Angeles. The group showed that mice that lacked a 
working copy of the Cntnap2 gene — which has been implicated in a 
small subset of human autism cases — had fewer oxytocin-containing 
neurons in the hypothalamus and socialized less with other mice than 
did control mice’. After receiving doses of oxytocin every day for two 
weeks, the mice behaved normally again. “Until this, there was no evi- 
dence that there was a subtype of autism that had to do with oxytocin 
deficits,” Geschwind says. 

His study points to a more targeted approach in the clinic. “Autism is 
highly heterogeneous, but if you can find subsets of individuals — those 
who have oxytocin-signalling deficits — they may be the best candidates 
for oxytocin therapy,’ says Karen Parker, a behavioural neuroscientist 
at Stanford. 

A handful of large-scale clinical trials are now getting under way to 
test oxytocin and oxytocin-based therapies for autism spectrum disor- 
der, and to work out who could benefit. Linmarie Sikich, a child psy- 
chiatrist at the University of North Carolina is heading the largest of 
these trials. Sikich plans to recruit 300 people with autism spectrum 
disorder, ranging in age from 3 to 17, and give them 6 months of either 
oxytocin or a placebo, followed by 6 months in which everyone will 
receive oxytocin. 

Unlike previous studies, the trial will include people with a wide range 
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of symptoms — and one of its major aims is to uncover the set of factors 
that influence whether and how strongly people respond to oxytocin. 
Sikich will analyse many measures of cognition and social function- 
ing, and collect blood samples to look for biomarkers — such as levels 
of oxytocin and the receptor it binds to — that are associated with a 
response. “Lin has really been trying to create conditions under which 
you could study the potential beneficial effects of oxytocin and really 
do this right,” says Carter. 

But Carter and other scientists are concerned by 
reports from the physicians and parents of children 
with autism spectrum disorder who say that they 
are already using oxytocin off-label — before it has 
been thoroughly tested. “We do not understand how 
the hormone works yet, or have enough informa- 
tion about what happens when it’s given repeatedly,” 
Carter says. “This is not a molecule that people should 
be self-administering or playing with.” 

Some work has pointed to a potential dark side to 
oxytocin. Carter’s group found that a single low dose 
of the hormone given to baby prairie voles improved 
their pair bonding as adults, but that higher doses 
interfered with that behaviour — possibly because oxytocin started to 
activate other receptors’®. And human studies have suggested that in cer- 
tain contexts, a puff of oxytocin can cause people to be more aggressive 
in defending themselves against outsiders or competitors’. In patients 
with a psychiatric condition known as borderline personality disorder, a 
single dose of oxytocin has been found to hinder trust and cooperation”. 

Young says that the oxytocin field would benefit from closer col- 
laboration between basic and clinical researchers. If basic scientists can 
work out how oxytocin helps the brain to process social stimuli, then 
that might help in the design of stimuli — in the form of behavioural 
therapies — that could be given alongside the hormone to change 
behaviour, just as oxytocin and pup calls together affect virgin mice. 
“T think in the future these two branches need to have more commu- 
nication,” Young says. 

But long before that, say researchers, oxytocin could use a rebranding. 
“Tt doesn’t induce love; it doesn’t induce massive amounts of trust,” 
Guastella says. “The problem we've got ourselves into is that we're try- 
ing to look for a simple answer: either oxytocin does or does not work 
ina patient population, or it does or does not enhance a certain social 
process.” 

But the science of life is rarely as simple as that. “Oxytocin is known 
to affect circuits in different ways, and it’s not going to affect everyone 
in the same way,’ Guastella says. “The sorts of biology we're studying 
here are incredibly complex.” m 


Helen Shen is a science writer based in Sunnyvale, California. 
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Science can’t solve it 


Democratically weighing up the benefits and risks of gene editing and artificial 
intelligence is a political endeavour, not an academic one, says Daniel Sarewitz. 


his year, several leading researchers 
sk sounded warnings about the 

risks of using the CRISPR gene-editing 
technique to modify human’ and other 
species’ genomes in ways that could have 
“unpredictable effects on future generations” 
and “profound implications for our relation- 
ship to nature” (see go.nature.com/jq5sik). 

Concerns are coming from the silicon 
sector as well. Last year, the physicist Stephen 
Hawking proclaimed that rapidly advanc- 
ing artificial intelligence (AI) could destroy 
the human race. And in 2013, former Royal 
Society president Martin Rees co-founded the 
Centre for the Study of Existential Risk at the 
University of Cambridge, UK, in part to study 
threats from advanced AI. 

Leaders of the scientific community are 
ready to share the responsibility for these 
powerful technologies with the public. 
George Church, a geneticist at Harvard 
University in Cambridge, Massachusetts, 


and others wrote last year of CRISPR that 
“the decision of when and where to apply 
this technology, and for what purposes, will 
be in our collective hands”. 

But scientists also want to control the 
terms of engagement. The US National 
Academies, for example, will “guide deci- 
sion making” by convening researchers and 
other experts later this year “to explore the 
scientific, ethical and policy issues associ- 
ated with human gene-editing research”. 
Scientists also emphasize the need for more 
research on risks and benefits to “better 
inform future public conversations”’. For 
instance, in the past few months, hundreds 
of scientists and technologists have signed 
an online open letter arguing that research 
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is necessary to learn how to accentuate the 
positive aspects of Al and avoid its potential 
perils (see go.nature.com/jcyjib). 


WHO VALUES WHAT 

The idea that the risks, benefits and ethical 
challenges of these emerging technologies are 
something to be decided by experts is wrong- 
headed, futile and self-defeating. It misunder- 
stands the role of science in public discussions 
about technological risk. It seriously under- 
estimates the democratic sources of science’s 
vitality and the capacities of democratic delib- 
eration. And it will further delegitimize and 
politicize science in modern societies. 

The never-ending debates about geneti- 
cally modified (GM) organisms, nuclear 
power, chemical toxicity and the efficacy of 
cancer screening should be evidence enough 
that science does not limit or resolve contro- 
versies about risk. 

There is no way to capture the full 
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> complexity of these issues from a 
scientific perspective. When new technolo- 
gies are introduced into complex socio- 
technical systems, everyone is ill-informed 
about the risks. Differing bodies of evidence 
provide ammunition for competing views. 
Legitimate experts are always available to 
support conflicting preferences. 

For example, an agricultural economist 
(concerned about crop yield) and an ecologist 
(concerned about ecosystems), will bring dif- 
ferent sets of evidence, and probably entirely 
different values, to bear on studying the 
impacts of GM organisms. Even among agri- 
cultural economists, some researchers prefer 
field trials that allow for the careful control of 
variables such as weather and soil type; oth- 
ers study actual farms to capture real-world 
variability. These two perspectives often yield 
contradictory results’. 

To many European consumers, moreover, 
research on crop yields is irrelevant. They are 
concerned’ with the motives behind corpo- 
rate decisions about crop varieties, aesthetic 
qualities of landscapes and food varieties, 
and principles of choice and transparency 
that would demand the labelling of GM 
foods even if there is no known health risk. 

In other words, risk is more a political and 
cultural phenomenon than it is a technical 
one. Turning its framing over to scientists 
and other privileged experts, such as ethi- 
cists and social scientists, is to turn politics 
and culture over to them as well. 

Scientists are not elected. They cannot 
represent the cultural values, politics and 
interests of citizens — not least because their 
values may differ significantly from those of 
people in other walks of life. A 2007 study® 
on the social implications of nanotechnology, 
for instance, showed that nanoscientists had 
little concern about such technologies elimi- 
nating jobs, whereas the public was greatly 
concerned (see ‘A matter of perspective’). 
Each group was being rational. Nanoscien- 
tists have good reason to be optimistic about 
the opportunities created by technological 
frontiers; citizens can be justifiably worried 
that such frontiers will wreak havoc on labour 
markets. 

Opening up questions of risk to democratic 
debate is on the whole good for science and 
innovation. The physicist Alvin Weinberg, 
a strong advocate for nuclear power, recog- 
nized this in the 1970s. Weinberg noted that 
the public debate of “questions like the prob- 
ability of a reactor accident runs the risk of 
introducing exaggeration and distortion’. Yet 
he also recognized that public pressure in the 
United States led to much greater attention to 
reactor safety than in the Soviet Union, where 
the public did not have a right “to participate 
in scientific and technological debate”’. 

Different cultural and political approaches 
to choosing and managing risks invite differ- 
ent approaches to problem solving. Having 
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A MATTER OF PERSPECTIVE 


Surveys conducted in 2007 suggested that 
nanoscientists had different ideas from the 
general public about the main risks of 
nanotechnology. 
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rejected nuclear power, Germany is becoming 
a demonstration project for renewable-energy 
technologies, even as its neighbour France has 
shown how nuclear can provide an alterna- 
tive low-carbon energy system. Opposition 
to GM was described as “a form of madness” 
by former European Commission science 
adviser Anne Glover, but it is part of a broader 
consumer movement that stokes demand for 
large-scale organic farming, integrated pest 
management, reduced use of antibiotics and 
reduced consumption of beef. Such prefer- 
ences open up alternative innovation path- 
ways that can add diversity and resilience to 
the global food system. 

These ongoing debates show that the 
scientific community's efforts to wrest con- 
trol over the specification of technological 
risk have not worked. Instead they have 
undermined the legitimacy of science. 

As new areas of contentious technology 
emerge, the way out of this situation is to 
let democratic deliberation lead the way in 
determining which values and world views 
ought to be protected and which sacrificed. 


WORLDWIDE VIEWS 
Ifan informed public discussion is needed, 
then let's have one. The capacity of people to 
learn about and deliberate wisely on the tech- 
nical aspects of complex dilemmas has been 
documented by social scientists for decades’. 
One model for how such discussions can 
be organized on an international scale has 
been developed by the World Wide Views 
(WWYV) alliance, coordinated by the Dan- 
ish Board of Technology. Since 2009, WWV 
has convened deliberations among diverse 
groups of about 100 citizens at numerous 
sites around the world — on global warm- 
ing, biodiversity, and, earlier this month, 
climate and energy. Thousands of people 
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have participated from all corners of society. 
(In Washington DC, a WWV group discuss- 
ing biodiversity included a homeless person, 
a roofer and a physicist.) 

Each WWV deliberation is held across 
the world during a single day. Participants 
are provided with the same written and 
video background material (vetted by expert 
panels) on the issue being discussed. The day 
is divided into four or five thematic sessions; 
participants, in moderated groups of five to 
eight people, discuss a set of questions for each 
theme, then vote on relevant policy choices. 
It is too early to assess its actual impact on 
policy, but WWV demonstrates the viability 
of large-scale, representative deliberation on 
complex matters of global import. 

Institutional models are also emerging 
that involve the public directly in choices 
about research that could influence the very 
nature of human existence — what might 
be termed the sciences of the existential. In 
the United States, for example, NASA last 
year commissioned the Expert and Citi- 
zen Assessment of Science and Technology 
(ECAST) network, to convene public delib- 
erations on options for asteroid detection, 
mitigation and recovery. The results are 
informing agency decisions. 

If the sciences of the existential are at 
hand, then let’s make decisions about them 
collectively. WWV-type deliberations could 
address questions about what is acceptable 
and what isn't, about appropriate governance 
frameworks for research, and about the rela- 
tive priority of different lines of study given 
ongoing and inevitable uncertainties and 
disagreements about risks and benefits. 

This sort of discussion should continu- 
ally feed into and set the boundary condi- 
tions for expert panels. A truly deliberative 
process that is geographically distributed 
and demographically inclusive can reveal 
the variations in how risks are selected and 
prioritized in different places and cultures. 
Values, governance regimes and research 
agendas can co-evolve in response to such 
knowledge. Democracy and science will 
both be better off. m 


Daniel Sarewitz is professor of science and 
society and co-director of the Consortium 
for Science, Policy and Outcome at Arizona 
State University, Tempe, Arizona, and is 
based in Washington DC, USA. 
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Move beyond differences 


Researchers and ethicists need to see past what can seem to be gendered debates 
when it comes to the governance of biotechnology, says Charis Thompson. 


his autumn, researchers and other 
experts will come together to dis- 
cuss the scientific, ethical and 
policy issues associated with gene-editing 
research in humans. Plans for the inter- 
national meeting were announced by the 
US National Academy of Sciences and the 
National Academy of Medicine after a 
study was published in which researchers 
used a gene-editing tool known as CRISPR 
to modify the genomes of non-viable 
human embryos’. 

Whether this meeting and others like it, 
planned in the United States, can help to 
forge a path for gene editing that takes into 
account all the relevant needs and concerns 
will depend on what efforts are made to 
integrate the diverse perspectives of people 
with different expertise and values. A first 
step to such integration is understanding 
how different perspectives arise. 

One division in cares and concerns seems 
at times to fall along stereotypical gender 
lines. This was powerfully demonstrated 
during a meeting in Atlanta, Georgia, last 
month on biotechnology and ethics. About 
200 global thought leaders gathered at 
BEINGS 2015 to “reach consensus on the 
direction of biotechnology for the twenty- 
first century”. On the day I attended, it was 
generally men who focused on containing 
biosecurity threats and on how to pre- 
vent regulation from impeding research. 
Women raised concerns about eugenics 
and class, race and gender inequalities in 
relation to biotechnology. Women were also 
the ones discussing the environment, the 
future of humanity, and the possible harms 
to the people who supply materials such as 
tissues and eggs, on which advances in bio- 
technology rely. 

People at the conference quipped that 
this gendered divide reflected the follow- 
ing: men are pro-science and women are 
pro-ethics; men draw on rational crite- 
ria to support their arguments whereas 
women draw on emotional ones; men are 
interested in tangible, pragmatic issues 
whereas women are interested in values 
and deep ethical thought. 

All of these ideas are reductive and sexist. 
They fail to recognize that people — what- 
ever their gender, race or class — generally 
focus on pragmatic and measurable solu- 
tions to the problems they find the most 
pressing. People just differ on which issues 
they think are the most important. 


That men and women at BEINGS 
tended to focus on different concerns 
reflected underlying differences of opin- 
ion about what needs to be taken into 
account when considering whether and 
how a particular area of science should 
proceed. These underlying differences are 
themselves gendered, because gender still 
affects which fields and subfields people 
enter, who does what kind of professional 
and domestic work, and where people’s 
political concerns lie’. 

Take questions of framing, governance 
and jurisdiction. When framing the prob- 
lems, people in 
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cal advances. The 

people in this camp generally believe that 
biotechnology (and science in general) 
should primarily be self-policing. They also 
tend to perceive non-experts and scholars 
with different areas of expertise as outsiders 
who slow down or stifle research. 


For those in the other camp, the worth of 


scientific breakthroughs should be judged 
in a broader social context: addressing cer- 
tain problems, such as entire populations 
being medically underserved, should be an 
integral part of the responsible develop- 
ment of biotechnology. People in this 
camp also tend to believe that the govern- 
ance of biotechnology should be shaped 
according to how and where biotechnol- 
ogy is developed and deployed, for whom 
and at what cost — and that a greater 
diversity of experts, and many more non- 
experts, should be involved. 

Obviously, there is not a clean two- 
gendered split, with men neatly falling into 
one camp and women into the other. Mul- 
tiple genders, sexualities and family struc- 
tures are increasingly being recognized 
worldwide, biologically and politically. 
Numerous scholars have shown that the 
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effects of a person's gender on their choice 
of work, politics and so on can be under- 
stood only if their class, race, sexuality, 
disability and citizenship are also consid- 
ered (see, for example, ref. 3). Nonethe- 
less, differences in approaches to bioethics 
come out in a gendered way because the 
factors that shape views of the appropri- 
ate scope and nature of science governance 
are themselves gendered. 

Currently, there is a tendency for some 
scientists, bioethicists, lawyers, patient 
advocates and others to regard as ‘woolly’ 
people who are concerned with issues such 
as health disparities. Equally, those who 
take a broader view of biotechnology’s 
responsibility perceive some scientists 
as too caught up in the monetization and 
competition of their fields. These views 
often manifest as a perceived deficit in 
rationality in the other camp. 

When scientific developments throw 
up difficult choices, scientists, social 
scientists and others need to stop posi- 
tioning other people as being for or against 
science, or for or against ethics. 

Mechanisms are needed to ensure 
that neither camp dominates the other. 
Following the autumn meeting, the 
National Academy should establish work- 
ing groups to study and document at regular 
intervals the wide range of phenomena that 
will shape and be shaped by gene-editing 
research — from scientific breakthroughs 
to health disparities and disability justice. 

If specialist pursuit of certain issues in 
such working groups is combined with 
efforts to bring the diverse strands of 
understanding together, for instance in the 
same publications, we might end up with 
better science and better ethics. = 
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The El Toro II glacier on the Chile-Argentina border, near the Pascua Lama gold mine. 
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Legislation left out in the cold 


Mark Carey examines the cautionary tale of Argentina’s struggle to pass the 
world’s first glacier- protection law. 


r | Vhe human-driven ‘big melt’ in the 
cryosphere is a major concern, with 
issues such as reduced glacier run- 

off riddling headlines. Glaciers are crucial, 

scientifically, socially and economically: they 
affect sea level, feed rivers, yield some of the 
world’s most precise climate data in ice cores, 
and draw tourism to icy landscapes. But as 

Jorge Daniel Taillant explains in Glaciers, few 

people are actually doing anything about the 

melt — except in Argentina, where in 2010 

authorities implemented what he refers to as 

the world’s first glacier-protection law. 

The rocky path towards creation of that 
law started in 2006, when Argentina’s secre- 
tary of the environment (and Taillant’s wife) 
Romina Picolotti learned that the Barrick 
Gold mining company of Toronto, Canada, 
had dynamited three glaciers at its Pascua 
Lama mine on the Chile-Argentina border 
to access a gold seam. What followed was a 
struggle to create and implement a national 
law to save glaciers from the damaging 
impacts of mining, hydrocarbon explora- 
tion and industrial development. Taillant 
refers to the quest as 
cryoactivism, a term 
coined by French 
glaciologist Bernard 
Francou. 

Glaciers is a 
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blow-by-blow account 
of progress towards the 
enactment and imple- 
mentation of the law. It 
details both Picolotti’s 
glacier-saving activ- 
ism and Taillant’s own, 
through the Center 
for Human Rights 
and Environment 
(CEDHA) — a non- 
governmental organi- 
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Glaciers: The 
Politics of Ice 
JORGE DANIEL 


zation in Cordoba,  JAILLANT 
Argentina, ofwhich he Oxford Univ. Press USA: 
: 2015. 


is executive director. 
Asa result, the book 
is largely an autobiographical account. But it 
offers discussion of glacier features, rock gla- 
ciers and the periglacial system, brief accounts 
of glacier shrinkage around the world and 
a chapter that includes an assortment of 
vignettes about human-glacier dynamics — 
from the creation of artificial glaciers in India 
to the 5,300-year-old mummy Otzi, found in 
ice on the Austrian-Italian border. 

Taillant notes that in 2008, Picolotti ush- 
ered the Minimum Standards Law for the 
Protection of Glaciers and the Periglacial 
Environment through Argentina’s Congress 
with a unanimous vote in favour. The law 
sought to conserve glacial and periglacial 
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environments as hydrological resources, 
appeasing environmentalists and anti- 
mining activists who sought to save such 
resources for agriculture and potable water. 
But two weeks later, President Cristina 
Fernandez de Kirchner vetoed the law. 
Taillant speculates that “it is more than 
likely” that Barrick Gold president Peter 
Munk intervened. He bases his argument on 
three things: the law directly threatened bil- 
lions of dollars of gold buried at Pascua Lama; 
Munk had “a one-on-one relationship with 
the Kirchners”; and Argentina's mining prov- 
inces (which might otherwise have been sus- 
pected of providing the opposition) remained 
supportive of the law. In 2010, several prov- 
inces enacted their own glacier-protection 
laws; Santa Cruz was “the first jurisdiction in 
the world” to adopt one. Later that year, envi- 
ronmental activists got support from Sena- 
tor Daniel Filmus and Congressional Deputy 
Miguel Bonasso to push a new national gla- 
cier-protection law through the Congress: this 
one defined glaciers as “strategic freshwater 
reserves” for Argentina. Kirchner approved it. 
Taillant shows how enforcement of this 
law has had a bumpy ride. A national gla- 
cier inventory, crucial for identifying what 
needs protection, was mandated but remains 
unfinished. In 2010, in an effort to stand up 
to what he sees as mining-company power, 
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Taillant enrolled in a one-week glaciol- 
ogy course. He has since worked through 
CEDHA to identify glaciers using satellite 
imagery, and to conduct observations in 
mining regions. He strives to pinpoint gla- 
cier ‘hotspots, where protection is lacking or 
mining claims or roads are a threat. Lower- 
level Argentinian courts have twice ruled to 
uphold the glacier law. 

Although it is informative on Argentina's 
environmental policy, Taillant’s book misses 
the mark on some of the science — espe- 
cially when compared to glaciology texts or 
books such as Mariana Gosnell’s Ice (Univ. 
Chicago Press, 2007) and Mark Bowen's Thin 
Ice (Henry Holt, 2005). When it discusses 
glaciers as water sources, the book does not 
precisely quantify the contributions of gla- 
cier runoff to downstream water supplies. It 
suggests that all glacial ice translates directly 
into water for human consumption, which 
exaggerates glacial input and runs contrary to 
studies that reveal that glaciers contribute up 
to 30% of downstream water in Peru and 27% 
in Bolivia, with sharp variations between dry 
and wet seasons. Nor does Taillant explain 
how groundwater, precipitation and land-use 
practices influence hydrology in glacierized 
watersheds. The book's claim that perennial 
snow is the same thing asa glacier, and refer- 
ence to “glaciers” that are only 3 metres thick 
(and therefore lack internal ice deformation, 
a defining characteristic) will raise eyebrows 
among glaciologists. When referring to 
diminishing glacier runoff in the Himalayas, 
the book cites treehugger.com — and at other 
points, Wikipedia. 

Taillant argues that glacial contributions to 
hydrology are the basis for the glacier-protec- 
tion law. But we do not know what proportion 
of Argentina's downstream water they supply, 
during what season, or how that varies from 
glacier to sea. Without quantifying or fully 
explaining these contributions, the law and 
this book seem motivated by cultural values 
about endangered ice, rather than by scientific 
data. This tension is one that I have analysed 
before and that has been tackled by authors 
including Julie Cruikshank in Do Glaciers 
Listen? (Univ. British Columbia Press, 2005) 
and Eric Wilson in The Spiritual History of Ice 
(Palgrave Macmillan, 2009). Environmental 
laws, including those on glacier protection, 
evolve in the fraught national-to-global 
context of competing agendas, from the cor- 
porate to the governmental to that of envi- 
ronmental activists. As such, they frequently 
speak for, rather than with, local residents — 
who, as Taillant himself argues, have a human 
right to glaciers. m 
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Books in brief 


Life’s Greatest Secret: The Race to Crack the Genetic Code 
Matthew Cobb PROFILE (2015) 

James Watson’s The Double Helix (Athenaeum, 1968) and Horace 
Judson’s The Eighth Day of Creation (Touchstone, 1979) are classics 
on the 1953 discovery of DNA’s structure, and the research that 
consolidated it. Zoologist Matthew Cobb richly recontexualizes the 
tale, tracing the interplay between biology, chemistry and physics that 
led to and amplified the breakthrough. This is a lucid explication of 
the science and the stories of key players, from X-ray crystallographer 
Rosalind Franklin and physicist Max Delbruck to Oswald Avery, who 
linked DNA to genes, and information theorist Claude Shannon. 


Foragers, Farmers, and Fossil Fuels: How Human Values Evolve 
lan Morris PRINCETON UNIVERSITY PRESS (2015) 

Energy capture — as a “brute material force” — shapes human 
values, argues archaeologist lan Morris in this global-scale study of 
cultural variation. Looking at modes of capture from hunting and 
gathering to the agricultural and industrial revolutions, he avers that 
each determines population size, social organization and values. 
Now, writes Morris, our globalized society faces a “Malthusian 
collapse” — and a lurch towards new values — driven by nuclear 
weapons, climate change and digitization. With contributions from 
writer Margaret Atwood, sinologist Jonathan Spence and others. 


The Naked Surgeon 

Samer Nashef SCRIBE (2015) 

UK consultant cardiac surgeon Samer Nashef joins the swelling 
ranks of medics who have penned frank inside stories. Piquant detail 
abounds: operating on the heart, for instance, involves “twisting 

it, pressing on it, and sometimes turning it upside down”. But it is 
Nashef’s long study of risk that injects nuance. It began in 1977, when 
he discovered that arterial surgeons were responsible for the worst 
outcomes in a sample of abdominal aortic aneurysm operations. 
Such failures have, he shows, driven quality measurement in 
medicine, including his own heart-surgery risk model, EuroSCORE. 


Pig Tales: An Omnivore’s Quest for Sustainable Meat 

Barry Estabrook W. W. NORTON (2015) 

Cheap pork should give us pause, writer Barry Estabrook argues 
in this cogent, level-headed investigation of the pig as raised 

and researched. The animal is one of the most intelligent ever 
domesticated, yet some 97 million in the United States are packed 
into reeking factory farms, with widespread ‘crating’ of pregnant 
sows. Yet as Estabrook reveals, humane, state-of-the-art husbandry 
is within our grasp and ultimately more profitable. He introduces 
an impressive group of animal scientists, farmers and pigs, from 
computer-savvy lab piglets to havoc-wreaking feral hogs. 


The Technical Image: A History of Styles in Scientific Imagery 

Editors Horst Bredekamp, Vera Diinkel and Birgit Schneider 

UNIVERSITY OF CHICAGO PRESS (2015) 

This multidisciplinary study trains an art historian’s eye on historical 

scientific imagery. Editors Horst Bredekamp, Vera Diinkel and Birgit 

Schneider draw on research from the Humboldt University of Berlin 

and a range of haunting images. They show that an iconic 1896 

radiograph of a hand by X-ray discoverer Wilhelm Réntgen prompted 
j both rhapsodies over a “photography of the invisible” and frustration 

among medics struggling to use such images for diagnosis. Barbara Kiser 
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| COMMENT | BOOKS & ARTS 


In Disaster Playground, firefighters train for an asteroid impact. 


Space-rock alert 


Alexandra Witze watches a pair of films on asteroids — 
according to many, a vast accident waiting to happen. 


Impact: two films are in competition 

to be this generation’s seminal asteroid 
movie. Both offer crucial information about 
the asteroid threat to new audiences. 

Disaster Playground and 51° North are 
released widely on 30 June, the inaugural cel- 
ebration of Asteroid Day — the anniversary of 
the 1908 meteorite explosion over Tunguska, 
Siberia. The event is meant to spread aware- 
ness of the threat of space rocks, repackaging 
the alarms that a small group of skywatching 
scientists have been issuing for decades. The 
novelty comes in how these films do it. 

Both explore the ramifications of what 
might happen ifan asteroid were discovered 
ona collision course with Earth. Both feature 
a narrator who interacts with real-life aster- 
oid experts in a variety of offbeat settings. 
Both are a far cry from conventional docu- 
mentaries, being more mash-ups of scientific 
information with fictionalized narratives. 

Disaster Playground, directed by Nelly Ben 
Hayoun, is a visually arresting roller-coaster 
ride through California and the American 
Southwest, featuring interviews with high- 
impact scientists from NASA, the Sandia 
National Laboratories and elsewhere. It opens 


S™ aside, Armageddon and Deep 
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Disaster Playground 
DIRECTOR: NELLY BEN HAYOUN 
Nelly Ben Hayoun Studio: 2014. 


51° North 
DIRECTOR: GRIGORIJ RICHTERS 
Films United: 2015. 


in Texas, with a cowboy and his horse startled 
by a huge blast. Ambulances and firefighters 
race through the debris left by the impact. 
Within minutes, meteor scientist Peter 
Jenniskens of NASA’s Ames Research Center 
in Moffett Field, California, is presiding at an 
official-looking podium, sombrely reading a 
statement about the disaster. 

In real life, Hayoun has the impressive title 
“designer of experiences at the SETI Institute’, 
the organization in Mountain View, Califor- 
nia, that explores life's place in the Universe. 
The film is certainly a designed experience. 
Hayoun ferries a bright-green toy dinosaur 
and a huge red telephone from interview 
to interview to represent the threat of mass 
extinction and the alerts needed to save the 
world from apocalypse. She uses offbeat 
camera angles, a pulsing soundtrack from 
rave band The Prodigy and title screens with 
giant capital letters to create a frenetic pace. 
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Hayoun coaxes scientists into the closest 
thing they may ever achieve to performance 
art. Ames astronomer David Morrison 
recreates his influential 1993 speech to Con- 
gress on the asteroid threat; his boss, Pete 
Worden, breaks out a Viking helmet and 
shield as a backdrop for an otherwise staid 
office interview. Hayoun’s own appearances 
bind it all together as she hurries from shoot 
to shoot, pausing only to buy a pair of cowboy 
boots on her way to explore Disaster City, an 
emergency-response training centre in Texas, 
where search and rescue teams work their way 
through intentionally collapsed buildings. 
There is no time to wait when disaster strikes. 

Grigorij Richters’ 51° North is similarly 
kinetic but much more self-absorbed. It 
centres on a fictional YouTube star called 
Damon, who becomes obsessed with the 
asteroid threat. His reality-show audience is 
gradually alienated as he turns from taping 
his girlfriend and his dogs to making show 
after show about space-rock disasters. That all 
changes when — well, no spoilers. But given 
the title and the fact that it is about asteroids, 
you can guess what happens to London. 

Richters has created a true millennial 
approach to the asteroid threat. Damon is 
constantly on camera: taping, uploading, 
tweeting, instagramming, sharing every 
moment. Brian May, Queen guitarist and 
astrophysicist, provides the soundtrack. 

There is some science. Damon interacts 
with many of Britain’s leading asteroid 
experts, and even tours the Spaceguard 
Centre in Knighton, the United Kingdom's 
leading asteroid-information centre. He 
works in smooth explanations of the vari- 
ous strategies for diverting an asteroid on 
a collision course with Earth. Footage from 
the many Russian car cameras that recorded 
the Chelyabinsk meteorite strike in Febru- 
ary 2013 fits the video-focused approach. 
Unlike in Disaster Playground, there is no 
wider discussion of what the end of the 
world means for anyone other than the self- 
obsessed Damon. 

Today, the cutting edge of asteroid science 
lies in pushing the limits of detection, 
searching for smaller and smaller risky space 
rocks. Scientists have pretty much nailed the 
chances of spotting any world-ending aster- 
oid, a rock big enough to wipe out an entire 
region. What is left are the ones that can slip 
through the current net of telescopes and hit 
with little to no warning. 

As New Year celebrations welcomed 
in 2014, an asteroid a few metres wide 
entered the atmosphere and disintegrated 
over the Atlantic Ocean — all while many 
of the astronomers who would normally 
track it were on a holiday break. We do live 
ina disaster playground. m= 


Alexandra Witze is a correspondent for 
Nature based in Boulder, Colorado. 
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Russian science 
loses to politics 


Philanthropist Dmitry Zimin 

is closing down his successful 
Dynasty Foundation — modern 
Russia’s first private science- 
funding organization — after 

the Ministry of Justice fined it 

for being a “foreign agent” (see 
Nature 521, 273; 2015). Like 
many other Russian scientists, we 
believe that these events will have 
dire immediate and long-term 
consequences for Russia's science. 

Ironically, the government 
has been trying to revitalize 
Russian science in the past few 
years. Along with increased 
research funding to universities 
and a drastic overhaul of science 
management, it set up a mega- 
grant programme to attract 
back Russian scientists working 
abroad. It also created innovation 
centres and transferred fund 
management from the previously 
independent Russian Academy 
of Science to a government 
organization (see go.nature.com/ 
m75bj4). 

The controversial law that 
claimed the Dynasty Foundation 
is intended to curtail perceived 
foreign influence in Russian 
politics. This case suggests that 
the future of Russian science 
depends on political forces 
to a greater extent than the 
government seems prepared to 
acknowledge. 

Fyodor A. Kondrashov 
University of Pompeu Fabra, 
Barcelona, Spain. 

Alexey S. Kondrashov 
University of Michigan, 

Ann Arbor, USA. 

Mikhail S. Gelfand Russian 
Academy of Sciences, Moscow, 
Russia. 
fyodor.kondrashov@crg.eu 


India: multi-author 
papers skew ranking 


Your ranking of India’s top 

ten institutions, scored by the 
number of research papers in the 
Scopus citation database over the 
past five years, is distorted by the 


exceptionally large number of 
citations attracted by papers with 
hundreds of authors (see Nature 
521, 142-143; 2015). 

This distortion is often 
corrected by limiting citations 
to individual authors — for 
example, by using an individual 
h-index for reference. Another 
corrective measure would be to 
exclude papers with more than 
100 authors from citation metrics. 
For example, the Large Hadron 
Collider’s ATLAS collaboration 
includes thousands of authors, 
so exclusion would have roughly 
the same effect as using the 
individual h-index (J. E. Hirsch 
Proc. Natl Acad. Sci. USA 102, 
16569-16572; 2005). 

Exclusion would introduce 
major changes in some of your 
listed entries. For example, 

I calculate that about 20% of the 
publications attributed to the 
highly rated Panjab University 
have long author lists and 
contribute almost two-thirds 
of the citations. Excluding 
these papers reduces Panjab 
University’s citation impact 
ratio from 1.4 to 0.7, causing 
it to drop out of your top ten; 
the fall is comparable for your 
second-placed Tata Institute 
of Fundamental Research 

in Mumbai. The fall for the 
Indian Institute of Technology 
at Bombay and Guwahati is 
less marked at 15% and 12%, 
respectively, because these each 
generate proportionally fewer 
papers with long author lists. 
P. Sriram Indian Institute of 
Technology, Chennai, India. 
sriram@iitm.ac.in 


India: assess social 
impact of technology 


Researchers priorities for 
improving science in India 
should include a commitment to 
assess the social impacts of new 
technologies in the Indian context 
(see Nature 521, 151-155; 2015). 
Big dams and atomic-energy 
programmes offered solutions 
to many of India’s problems after 
independence in 1947; the green 


revolution and biotechnology 
followed. Stem-cell therapy, 
nanotechnology, synthetic 
biology and pharmacogenomics 
are all now taking off. 

India sometimes seems 
prepared to overlook the potential 
societal consequences of such 
technologies in the name of 
development and progress. Its 
Land Acquisition Bill 2015, for 
example, seeks to exempt some 
important projects on defence, 
infrastructure and industrial 
regions from social assessment. 

The impact of new technologies 
on India’s sizeable poor and 
vulnerable population should be 
analysed before such innovations 
are introduced (see D. Greenbaum 
Nature Biotechnol. 33, 425-426; 
2015). Analysis would need to 
include investigation of their 
affordability and equitability 
(S.S. Tiwari and S. Raman New 
Genet. Soc. 33, 413-433; 2014). 
Shashank S. Tiwari ITT Labs, 
Birmingham, Alabama, USA. 
Ekta Tiwary University of 
Alabama at Birmingham, USA. 
shashank 17t@gmail.com 


Speed translation of 
misconduct reports 


Two reports of scientific 
misconduct were publicly released 
around 19 May this year, relating 
to work by graduate student 
Michael LaCour at the University 
of California, Los Angeles, and 
by surgeon Paolo Macchiarini at 
Stockholm’s Karolinska Institute 
(see Nature 521, 406-407; 2015). 
Surprisingly, the unofficial report 
on LaCour’s work spread like 
wildfire across the media, whereas 
the official report on Macchiarini’s 
received relatively little attention. 
This discrepancy could 
simply reflect popular interest 
in the different research areas 
investigated. But it is notable 
that the report on LaCout’s 
work was in English (go.nature. 
com/5dyi6w), whereas that on 
Macchiarini’s was first released in 
Swedish (go.nature.com/bzwaxt). 
In my view, misconduct reports 
would be more useful if they were 


simultaneously translated into 
English, the common language 
of scientific discourse. This 
would allow the wider scientific 
community to immediately assess 
the validity of the claims and 

to avoid new research based on 
falsified findings. 

Of the European non- 
English-speaking research 
integrity offices, only the Danish 
Committees on Scientific 
Dishonesty report in English. 
This hastens the all-important 
correction of the scientific record 
by enhancing accessibility and 
transparency. 

Chris H. J. Hartgerink Tilburg 
University, the Netherlands. 
c.h.j.hartgerink@uvt.nl 


US sanctions alarm 
physicians from Iran 


Iranian physicians in North 
America faced an unsettling 
constraint this month. Between 
29 May and 5 June, the US 
Educational Commission for 
Foreign Medical Graduates 
(ECFMG) stopped processing 
requests to verify credentials 
issued by Iranian institutions, 
pending clarification of 
restrictions on interactions 
between US and Iranian medical 
and educational organizations 
(see go.nature.com/jrcxzz). The 
Medical Council of Canada, 
which also uses the ECFMG, 
made a similar announcement. 

Iranian citizens are among the 
top ten groups of international 
physicians who acquired 
US certification in 2009 (see 
go.nature.com/ecmnkb) and 
were the largest group among 
immigrants entering post- 
medical training in Canada in 
the same year (see go.nature. 
com/95ayyf). 

As Richard Nephew — a former 
senior US official and leading 
sanctions architect — has warned, 
there is a risk that the US foreign- 
policy tool of using sanctions to 
isolate Iran could backfire (see 
go.nature.com/ocrini). 

Mehdi Aloosh Montreal, Canada. 
md_aloosh@hotmail.com 
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OBITUARY 


John Forbes Nash 


tician whose life took dramatic turns 

between genius, mental illness and celeb- 
rity status, made major contributions to game 
theory, geometry and the field of partial 
differential equations. 

Nash, who died on 23 May, was born in 
Bluefield, West Virginia, in 1928. His father 
was an electrical engineer and his mother 
a schoolteacher. In 1945, after excelling in 
mathematics at high school, he attended 
the Carnegie Institute of Technology (now 
Carnegie Mellon University) in Pittsburgh, 
Pennsylvania. At first he studied chemi- 
cal engineering, but soon after enrolling he 
switched to chemistry and then to maths. 

In Nash’s final year, one of his profes- 
sors wrote a recommendation letter for the 
19-year-old supporting his application to 
graduate school. It simply stated: “He is a 
mathematical genius.” In 1948, Nash was 
accepted by Harvard University in Cam- 
bridge, Massachusetts, and by Princeton 
University in New Jersey. He chose Princeton. 

Asa PhD student, Nash proved the exist- 
ence of the equilibrium that now carries his 
name. His 1950 paper ‘Equilibrium points in 
n-person games, contains about 330 words, 
two references and not one equation (J. EF Nash 
Jr Proc. Natl Acad. Sci. USA 36, 48-49; 1950). 
One of the citations is the 1944 book Theory 
of Games and Economic Behavior — in which 
mathematician John von Neumann and econ- 
omist Oskar Morgenstern introduce game 
theory, a mathematical approach for studying 
strategic and economic decisions. 

The Nash equilibrium is a position in 
a game from which none of the players 
can change their strategy to improve their 
pay-off. Imagine a game with two players 
(yourself and another person) and two strat- 
egies, A and B. If you both choose A, your 
pay-off is 2. Ifyou choose A and your oppo- 
nent chooses B, you score 0. If you choose B 
and the other player chooses A, your pay-off 
is 3. If you both choose B, you score 1. The 
same applies to your opponent. 

In this example, the Nash equilibrium 
occurs when both players choose B. Ifboth 
players choose B, their pay-off is 1; if either 
player switches to A, their pay-off falls to 0. 
In other words, neither player can indepen- 
dently switch their strategy and improve their 
pay-off. Observe that if both players select A, 
there is no Nash equilibrium because you 
couldimprove your pay-off by switching to B. 

Calculating the Nash equilibrium can be 


Ji: Forbes Nash, an exalted mathema- 
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(1928-2015) 


Master of games and equations. 


a formidable task in a complex game. There 
is also the uncertainty over whether the 
person you are playing against is sufficiently 
rational to play the equilibrium strategy. If 
both players are rational and their rational- 
ity is common knowledge, they would play 
it. But experiments often reveal that people 
are not rational. Regardless of whether people 
actually play the Nash equilibrium in social or 
economic interactions, working out what it 
is (or what the Nash equilibria are) is the first 
step to analysing any game. 

Although dismissed at the time by von 
Neumann as a triviality, the Nash equilib- 
rium has been used to analyse all sorts of 
competitive situations. As well as being key 
to decision-making in economics and poli- 
tics, the idea is important in biology. Here, 
the nearly equivalent concept, formulated by 
evolutionary biologist John Maynard Smith 
in the 1970s is called an evolutionarily stable 
strategy (ESS). If all members of a popula- 
tion adopt an ESS, then natural selection 
prevents a rare mutant from spreading. 

On completing his PhD, Nash joined the 
Massachusetts Institute of Technology (MIT) 
in Cambridge in 1951. He worked — first as 
an instructor and later as a professor — in the 
mathematics faculty until he resigned in 1959. 
It was while he was at MIT that Nash met and 
married Alicia Lopez-Harrison de Lardé, a 
physics student there. 

Among mathema- 
ticians, Nash is best 
known for his work in 
real algebraic geometry 
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For more on John 
Nash’s life, see. 
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and nonlinear partial differential equations. 
He was not afraid to tackle the hardest 
problems in the field, and he succeeded. In 
1957, he — in parallel with Italian mathe- 
matician Ennio de Giorgi — solved Hilbert’s 
nineteenth problem involving partial differ- 
ential equations. 

It was during a talk in 1959 on whatis seen 
to be one of the hardest problems in maths — 
the Riemann hypothesis — that the audience 
realized that there was something wrong with 
Nash. His talk was incomprehensible. 

He was diagnosed with paranoid schizo- 
phrenia that year. Over the next two decades, 
Nash was in and out of hospitals. He under- 
went therapy, and for a while left the United 
States and sought asylum in Switzerland in 
an attempt to escape his imagined tormen- 
tors. For many years he wandered around the 
Princeton campus. Throughout this period, 
Alicia, who divorced Nash in 1963, oversaw 
much of his care. 

In the late 1980s, Nash reappeared in 
academic circles, and in 1994 he was awarded 
the Nobel Memorial Prize in Economic 
Sciences for his work on game theory. The 
Nobel and the 2001 film A Beautiful Mind, 
based on journalist Sylvia Nasar’s book of the 
same name, which recounted Nash's strug- 
gles, propelled him into the limelight. 

In May this year, Nash received the Abel 
prize from the Norwegian Academy of 
Science and Letters for his work on partial 
differential equations. On the way back from 
the celebration in Norway, John and Alicia 
(who had remarried in 2001) were killed in 
a car accident in a taxi on the New Jersey 
turnpike. John was 86. 

I met John in 1998 at the Institute for 
Advanced Study in Princeton. Over the years, 
I gave several talks there that he attended. One 
summer's day, when the usual sitting arrange- 
ments for lunch were disrupted by the closure 
of the main kitchen, I noticed John, the physi- 
cist Edward Witten and Andrew Wiles, the 
British mathematician who proved Fermat's 
last theorem, sitting down together at a small 
table. I wondered which of them would start 
the conversation. None of them did. I seem to 
remember that they ate their meal in silence. m 


Martin A. Nowak is professor of 
mathematics and biology, and director of 
the Program for Evolutionary Dynamics 
at Harvard University in Cambridge, 
Massachusetts, USA. 

e-mail: martin_nowak@harvard.edu 
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ASTROPHYSICS 


Dust-poor galaxies at early times 


Observations of galaxies that formed early in the Universe’s history reveal much lower dust levels than are found in sources 


from a slightly later era. It seems that galaxies underwent rapid change during a relatively short period. S! 


ETTER P.455 


VERONIQUE BUAT 


he study of the most distant galaxies, 
"TPevsene as they were about 1 billion 

years after the Big Bang, is crucial 
for our understanding of the star-forming 
activity and the physical processes at work 
in these young systems. Writing in this issue 
(page 455), Capak etal.’ presenta study of nine 
such galaxies using a linked-up telescope array. 
They find that the dust and gas properties in 
these systems hint at an interstellar medium 
(ISM) that is much less evolved than in galax- 
ies about 2 billion years older. This suggests 
that there was a rapid change in the overall 
properties of galaxies during the early life of 
the Universe. 

The expansion of the Universe shifts the 
ultraviolet (UV) light emitted by newly formed 
stars in remote systems to longer (visible 
and near-infrared) wavelengths that, unlike 
UV light, can be observed by ground-based 
telescopes. The most distant objects known 
today are detected as a result of a break in the 
continuum of their redshifted spectra at wave- 
lengths of around 0.1 micrometres; this is due 
to the absorption of UV photons by neutral 
hydrogen in the intergalactic medium. The 
absorption occurs for photons with energies 
corresponding to wavelengths shorter than the 
Lyman-a line of hydrogen (1,216nm), and gal- 
axies whose distances have been estimated by 
this method are known as Lyman break galax- 
ies (LBGs). The most comprehensive surveys 
undertaken so far have led to detections of 
very young LBGs that formed approximately 
0.5 billion years after the Big Bang’. 

The presence of interstellar dust compli- 
cates the study of galaxies, and affects meas- 
urements of fundamental, observationally 
derived properties such as the star-formation 
rate. This is because dust is efficient at absorb- 
ing the energetic UV photons (a proxy for the 
star-formation rate) that are emitted by young 
stars and at re-emitting their energy in the 
infrared domain, at wavelengths longer than 
5 um. This is a complex process that depends 
not only on the amount of dust present, but 
also on its distribution relative to the stars and 
on its composition’. Overall, dust substantially 
reduces the intensity of stellar light reaching 
the telescopes’. 
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Figure 1 | The Atacama Large Millimetre Array (ALMA). Capak et al.’ used 20 of ALMA’ antennas 
to study the interstellar medium (ISM) of 9 galaxies that were present when the Universe was only about 
1 billion years old. The authors found that their sources contain a smaller amount of dust than expected. 
Some of the galaxies in the sample may have an ISM similar to that of the Small Magellanic Cloud 

(a satellite galaxy of the Milky Way), which is visible here (right of centre) as the smaller of the two 
Magellanic Clouds above the antennas. 


A straightforward method to account for the 
UV light produced in galaxies involves observ- 
ing the radiant energy that is absorbed by dust, 
is re-emitted and is then redshifted in the far- 
infrared and submillimetre domains. However, 
the low sensitivity of detectors, combined with 
the poor spatial resolution achieved by single- 
dish telescopes, make surveys of high-redshift 
galaxies at these wavelengths less efficient than 
those at optical or near-infrared wavelengths. 
Even the Herschel Space Observatory, which 
detected’ the infrared emission from dust in 
galaxies at redshifts of up to 2-3 (correspond- 
ing to a time roughly 2 billion to 3 billion 
years after the Big Bang), was able to detect 
only hyper-luminous sources at much larger 
distances’. 

Given that directly measuring the long- 
wavelength emission from dust is so challeng- 
ing, astronomers resort to empirical relations 
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to derive dust’s infrared luminosity. One such 
relation links this luminosity to the stellar 
UV luminosity’ for a representative sample 
of nearby, actively star-forming galaxies, for 
which both luminosities have been accurately 
measured. Unfortunately, this recipe is not uni- 
versally applicable because it depends on the 
properties of the ISM (such as the composition 
of dust and its distribution relative to the stars), 
as well as on the stellar populations in the galax- 
ies®, Despite these caveats, however, it is exten- 
sively used to estimate the level of obscuration 
of the stellar UV light by ISM dust for galaxies 
across a wide redshift range. It will therefore be 
important to check the validity of this relation- 
ship — especially for young, high-redshift sys- 
tems. A critical evaluation could also yield clues 
to the properties of the ISM at those early times. 

Capak et al. used the Atacama Large Milli- 
metre Array (ALMA), which was designed to 


ESO/C. MALIN 


overcome both the resolution and sensitivity 
problems (Fig. 1). Being an interferometer 
(a series of telescopes linked up to combine 
astronomical observations), ALMA has a 
small field of view that is suitable for observ- 
ing well-centred sources, and it can detect the 
weak submillimetre emission originating from 
dust in ordinary galaxies at high redshifts. The 
authors used 20 of ALMA’ antennas in uni- 
son to observe the dust and gas emissions of 
9 typical LBGs located at redshifts 5-6; these 
correspond to a time when the Universe was 
about 1 billion years old. 

Capak and colleagues selected their sample 
from the Cosmic Evolution Survey field, a two- 
square-degree area that has been extensively 
observed by most of the major telescopes, from 
the ground and from space. ALMA detected 
the thermal dust emission in four galaxies, 
and an ISM spectral line emitted from gaseous 
carbon at a wavelength of 158 um in all nine of 
them; the carbon feature is the dominant ISM 
emission line of galaxies in the far-infrared 
domain. Such a high detection rate is out- 
standing, because previous attempts failed to 
simultaneously detect the carbon feature and 
thermal dust emission’. 


NEURODEGENERATION 


The authors’ study argues for a very low dust 
content and stellar-light obscuration in these 
systems. The four galaxies whose thermal dust 
emission was detected may harbour an ISM 
similar to that of the Small Magellanic Cloud 
(a satellite galaxy of the Milky Way), which is 
characterized by a low abundance of elements 
heavier than helium. The upper limits put on 
the dust emission of the remaining five sources 
call for an even more extreme situation with 
a much lower infrared emission. That seems 
to be at odds with the observed UV luminos- 
ity of these systems. The enhanced carbon 
emission-line intensities also suggest low dust 
levels relative to the gas present in these early 
galaxies, although other explanations cannot 
be excluded. 

An immediate consequence of these 
findings is that the classical calculations 
used to derive obscuration due to dust from 
the observed UV continuum luminosity 
are unlikely to be valid for LBGs in the early 
Universe. As a result, the star-formation 
rate considered to be appropriate for these 
galaxy types is likely to have been over- 
estimated by factors of between two and four 
in previous studies. Last but not least, this 


Evolved protection 
against human prions 


A genetic variant of PrP, the protein that forms prions, confers protection against 
the human prion disease kuru by inhibiting the conversion of functional isoforms 
to the abnormal, disease-causing conformation. SEE LETTER P.478 


GLENN TELLING 


he group of neurodegenerative 
disorders known as transmissible 
spongiform encephalopathies are 
incurable and lethal. They are caused by 
prions, a type of infectious agent that has 
challenged our understanding of the concepts 
of infection and inheritance. In prion diseases, 
an abnormal isoform of prion protein (PrP), 
dubbed PrP*S induces conformational change 
in the functional isoform PrP“. The functional 
form adopts a disease-causing structure, and 
these abnormal proteins propagate throughout 
the brain, damaging nerves. But the mecha- 
nistic underpinnings of this infectious propa- 
gation remain enigmatic. On page 478 of this 
issue, Asante et al.' describe a form of PrP that 
seems to be resistant to such conformational 
change, providing some insight into the long- 
standing issue of whether it is possible to pre- 
vent prion diseases. 
Kuru, a human prion disease, became a 
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Figure 1 | Valine variants. The wild-type 
amino-acid sequence of human prion-protein 
residues 126 to 130 is glycine (G), G, tyrosine (Y), 
methionine (M) and leucine (L). Replacement of G 
for V at residue 127 is known to confer protection 
against the prion disease kuru. Asante et al.' 
demonstrate that, in mice, one copy of V127 also 
confers resistance to sporadic Creutzfeldt-Jakob 
disease (sCJD), and two copies protect against all 
human prion diseases. 
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pioneering work paves the way for future 
observational campaigns. Although observ- 
ing the low levels of dust emission from large 
samples of high-redshift galaxies may prove 
challenging even for ALMA, Capak and 
co-workers’ finding of enhanced carbon 
emission lines should become a useful tool in 
the study of star-forming galaxies at those 
early epochs. m 
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France. 

e-mail: veronique.buat@lam.fr 


1. Capak, P.L. et al. Nature 522, 455-458 (2015). 
2. Bouwens, R. J. et al. Astrophys. J. 803, 34 (2015). 
3. Witt, A. N. & Gordon, K. D. Astrophys. J. 528, 799 
(2000). 
4. Burgarella, D. et al. Astron. Astrophys. 554, A70 
(2013). 
5. Gruppioni, C. et al. Mon. Not. R. Astron. Soc. 436, 
2875-2876 (2013). 
6. Riechers, D. A. et al. Nature 496, 329-333 (2013). 
7. Meurer, G. R., Heckman, T. M. & Calzetti, D. 
Astrophys. J. 521, 64 (1999). 
8. Boquien, M. et al. Astron. Astrophys. 539, A145 
(2012). 
9. Maiolino, R. etal. Mon. Not. R. Astron. Soc. 
(in the press); Preprint at http://arxiv.org/ 
abs/1502.06634v2 


leading cause of death among the Fore 
population of the highlands of Papua New 
Guinea in the mid-twentieth century’, 
killing more than 3,000 people in a population 
of 30,000. The aetiology of the disease was ini- 
tially puzzling, but in 1966, a study’ showed 
that brain extracts from people who had kuru 
caused progressive neurodegeneration when 
introduced into chimpanzees, following a pro- 
longed incubation period. This supported the 
idea that kuru is an infectious disorder. In fact, 
kuru is a devastating consequence of ritualistic 
cannibalism, in which the brains and other 
body parts of tribal members were consumed 
as an act of mourning. 

Fast forward to the end of the twentieth 
century, and interest in the kuru epidemic 
was renewed in response to the emergence of 
variant Creutzfeldt—Jakob disease (vCJD). This 
human prion disease results from exposure to 
prions that cause bovine spongiform enceph- 
alopathy, carried by cows with ‘mad cow’ 
disease. Then, in 2009, the authors of the 
current study discovered a genetic variant 
(a polymorphism) in PRNP, the gene that 
encodes PrP (ref. 4). The variant, which 
changes the amino acid at residue 127 from 
glycine (G) to valine (V), was present in one of 
the two copies of PRNP (mammals have two 
copies of each chromosome) in unaffected 
individuals in the kuru-exposed population. 
The authors proposed that the variant, called 
V127, conferred resistance to prion disease, 
and that it had been selected for in response to 
the kuru epidemic. 

Asante et al. set out to compare V127 with 
another polymorphism in close proximity 
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— amino-acid residue 129 of PrP, which can 
be either methionine (M) or V. It is thought 
that one copy of each 129 variant confers some 
resistance to human prion diseases by affecting 
the binding between identical prion proteins 
that is required for conformational change. 
The researchers injected human prions into 
mice engineered to express human forms of 
PrP harbouring different combinations of the 
two genetic variants. 

Mice with one copy of V127 were completely 
protected against kuru prions, and died of old 
age without developing neurological disease. 
By contrast, all mice without V127 developed 
disease after around 200 days, regardless of 
the variant at residue 129, confirming that 
this model system accurately recapitulates the 
resistance to kuru conferred by V127. Previ- 
ous work’ has indicated that the transmission 
properties of kuru are similar to those of the 
sporadic form of CJD (sCJD), and so it prob- 
ably came as no surprise to the authors to find 
that mice with one copy of V127 were also 
protected from sCJD prions. Interestingly, 
however, not all of these mice were protected 
against vCJD prions. 

Remarkably, mice carrying two copies of 
V 127 were completely protected against all 
forms of human prion disease (Fig. 1), indi- 
cating that the polymorphism at residue 127 
acts in a different manner from that at 129. 
These mice failed to manifest any signs of 
prion disease, a noteworthy response that was 
mimicked only in mice that did not express 
any prion protein. Finally, Asante et al. inves- 
tigated how the protective effect of one copy 
of V127 varied when wild-type PrP (in which 
the genetic sequence includes the G127 and 
M129 variants) was expressed at different 
levels. These experiments demonstrated that 
V127 acts asa ‘dominant negative inhibitor of 
prion conversion — not only is it itself resist- 
ant to conformational conversion, but it also 
inhibits conversion of wild-type proteins. 

It will be important to learn what structural 
consequences, if any, occur when V replaces 
Gat residue 127. There are currently no treat- 
ments or cures for transmissible spongiform 
encephalopathies, and structural infor- 
mation may hint at ways to inhibit PrP* 
propagation and maintain normal PrP© 
function. Approaches that capitalize on 
the profound dominant-negative proper- 
ties of V127 are obvious targets for further 
investigation. 

V 127 was naturally selected in response to 
epidemic human prion disease. In a similar 
manner to DNA- or RNA-based infectious 
agents, prions are subject to Darwinian forces, 
such that the structure of the disease-causing 
protein can be altered through selective pres- 
sure’. This begs the question of whether new 
prion strains might emerge to counter the 
protective effects of the V127 variant. It will 
be interesting to determine whether V127 
also prevents conformational changes in PrP 
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in experimental models of prion amplification, 
which have been shown to enable interspe- 
cies prion transmission’, and if so, what the 
infectious properties of the resulting prions 
might be. Finally, it will be valuable to inves- 
tigate the V127 polymorphism in the context 
of PrP from other species that are naturally 
susceptible to prion diseases, using similar 
approaches to those in the current study. This 
should help to ascertain whether the effects of 
this fascinating variant are universally protec- 
tive against all combinations of species and 
prion strains. = 
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Precision positioning 
with peptides 


Two related peptides compete for binding to the same receptor to regulate the 
spacing of cells on the lower surfaces of leaves. This discovery highlights the 
complexity of cell signalling in plants. SEE ARTICLE P.439 


SACCO DE VRIES 


eing fixed in position, plants must be 
B able to respond to a multitude of envi- 

ronmental and developmental inputs 
over appropriate time frames. To do this, they 
use organic molecules such as auxin or ethyl- 
ene, as well as signalling molecules analogous 
to those used in animals, including peptides 
(short strings of amino acids) and hormones. 
But plant peptide signalling pathways, particu- 
larly interactions between peptides and their 
receptor proteins, are poorly understood. In 


Meristemoid 
cell 


Protodermal cell 


this issue, Lee et al.’ (page 439) define a plant 
signalling pathway in which two peptides 
compete for binding to one receptor, exerting 
opposing effects on the development of tiny 
openings called stomata on the lower surfaces 
of leaves. 

Stomata regulate the passage of oxygen, 
carbon dioxide and water vapour in and out 
of the leaf. These pores are surrounded by two 
guard cells, which mediate stomatal opening 
and closing in response to the time of day, to 
changes in the environment or to physical 
changes in the plant itself. Guard cells originate 


Stomata 
Guard cell 


Pavement cell 


Figure 1 | Orchestrating stomatal development. During leaf development, most protodermal cells 
develop into pavement cells, which line the underside of leaves. But a few can undergo differentiation 

and asymmetric division to form meristemoid cells and stomatal lineage ground cells (SLGCs), which 
together give rise to the guard cells that enclose stomata — pores on the surface of the leaf. Meristemoids, 
SLGCs and protodermal cells all express the receptor protein ERECTA (ER). ER activation by the 

peptide EPF2, which is secreted from cells that have already become meristemoids, suppresses the 
meristemoid-guard-cell lineage in adjacent cells. By contrast, the peptide Stomagen, which is produced in 
the overlying mesophyll cell layer, inhibits activation of ER and so promotes guard-cell differentiation. Lee 
et al.' report that EPF2 and Stomagen compete for binding to ER, to regulate the even spacing of guard cells 


between pavement cells. 
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from precursors called protodermal cells. 
Protodermal cells can either proliferate to 
give rise to pavement cells, which make up 
the protective epidermis covering the surface 
of the leaf, or differentiate into a meristemoid 
mother cell (MMC), a progenitor of the guard 
cell. MMCs divide asymmetrically, one daugh- 
ter cell becoming a meristemoid, the other 
becoming a stomatal-lineage ground cell, a 
progenitor that can either differentiate into 
pavement cells or, through a ‘spacing division, 
give rise to a second, satellite meristemoid 
cell. Meristemoids then undergo asymmetric 
divisions, and finally differentiate into pairs of 
guard cells that are evenly distributed between 
pavement cells (Fig. 1). But what governs these 
cell-fate decisions? 

Mutations in genes encoding receptor 
proteins of the ERECTA (ER) family cause 
a loss of control over stomatal spacing, lead- 
ing to the development of leaves with une- 
venly spaced stomata. The severity of this 
trait is further influenced by mutations in 
an ER co-receptor, TOO MANY MOUTHS 
(TMM), demonstrating that ER-family sig- 
nalling is a key regulator of stomatal devel- 
opment. ER-family receptors are activated 
by EPIDERMAL PATTERNING FACTOR 1 
(EPF1) and EPF2, peptides that exert dif- 
ferent, but overlapping, effects on stoma- 
tal development. By signalling through the 
receptor ER-LIKE 1 (ERL1), EPF1 orients 
stomatal spacing and prevents differentiation 
into guard cells, whereas EPF2-ER signalling 
restricts the formation of this lineage at an 
earlier stage. Both receptors exert their effects 
by activating an intracellular signalling cascade 
called the MAPK pathway (reviewed in ref. 2). 

In contrast to EPF1 and EPF2, the EPF-like 
peptide Stomagen promotes stomatal devel- 
opment by inducing guard-cell differentia- 
tion’. Lee et al. set out to investigate how these 
related peptides could exert such dramatically 
opposing effects. The authors found that the 
altered stomatal differentiation that occurs in 
plants harbouring mutant forms of the genes 
encoding ER, ERL1 or TMM was not affected 
by the addition of Stomagen. This suggests that 
Stomagen, like EPF1 and EPF2, acts through 
ER-family receptors. 

Lee and colleagues then provided extensive 
genetic evidence to support this idea. By 
inhibiting or overexpressing the gene encod- 
ing Stomagen, the authors showed that mis- 
expression of the peptide interferes with 
all signalling pathways mediated by the ER 
family. Next, using sophisticated genetic 
and in vitro biochemical experiments, they 
confirmed that Stomagen actively competes 
with EPF2 for ER, the two peptides binding 
to the receptor with similar affinities. Finally, 
the researchers showed that Stomagen could 
not activate MAPK signalling. They pro- 
pose that binding of Stomagen to ER-family 
receptors actually prevents MAPK signal 
transduction. 


These results provide evidence that a finely 
balanced system of closely related activating 
and inhibitory peptides locally modulates the 
signalling pathways required for proper sto- 
matal spacing in the plant epidermis (Fig. 1). 
One uncertainty in this model is the similar- 
ity in the dissociation constants (a measure of 
the binding strength between two proteins) 
of EPF2-ER and Stomagen-ER. This implies 
that massive changes in the local concentra- 
tion of one ligand, or a large adjustment in the 
ligand’s local receptor sensitivity, would be 
needed to induce a change in stomatal pattern- 
ing. Which mechanism is used in physiologi- 
cal conditions is an issue that clearly needs to 
be addressed. 

A competitive inhibition mechanism 
provides precise control of cellular spacing, 
analogous to the ‘lateral inhibition’ mechanism 
that regulates the spacing of animal cells. It is 
now imperative to determine whether similar 
mechanisms operate over time to generate 
repetitive differentiation events periodically 
during plant development. For example, 
systems such as the CLE40-CLV1-ACR4 
signalling pathway, which regulates stem- 
cell maintenance in the growing tips and 
roots of plants’, might use a similar combi- 
nation of activators and inhibitors. Future 
studies should also analyse whether local 
changes in the levels of particular peptide 
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combinations could weave a 3D pattern of 
locally activating and inhibitory signalling 
conditions. 

The results of the current study can be put 
in the larger context of plant signalling path- 
ways — which includes different combina- 
tions of ligands, receptors and co-receptors 
— acting in conjunction with local changes 
in signalling pathways mediated by organic 
molecules. Furthermore, because there is evi- 
dence for a direct connection between plant 
steroid signalling and the stomatal pathway’, 
yet another level of signalling seems plau- 
sible. A picture emerges in which almost 
every cell in a plant tissue can be exposed to a 
precisely defined set of signals. This new con- 
cept in plant spatial control should now be 
tested in other settings. m 
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The dynamics of 
temperature extremes 


Changes in the occurrence of atmospheric circulation patterns are not well 
understood. A study finds that these have been a big factor in observed changes 
in regional temperature extremes during recent decades. SEE LETTER P.465 


THEODORE G. SHEPHERD 


he planet is unequivocally warming’, 
Te the rate of warming varies in time 

and space. Although such variations 
are not unexpected, and do not challenge 
our grasp of the physics of global warming, 
understanding what drives them is crucial for 
predicting climate conditions at the regional 
scale. On page 465 of this issue, Horton et al? 
use a new analytical method to argue that, 
over many regions and seasons, changes in 
the occurrence of atmospheric circulation pat- 
terns were a substantial driver of the observed 
changes in mid-latitude temperature extremes 
in the Northern Hemisphere over land for the 
35 years to 2013. Such changes have increased 
not only the likelihood of summertime heat- 
waves in Europe, but also, since 1990, the 
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occurrence of low wintertime temperature 
extremes across central Asia’, 

Identifying changes in climate extremes at 
the regional scale is hampered by the chaotic 
internal variability of the climate system, which 
adds a large degree of noise to the climate- 
change signal. Climate variability is manifested 
in changes in the occurrence of atmospheric 
circulation patterns, which can vary strongly 
from year to year, but can show trends over 
multidecadal periods too. Such variations are 
also reflected in many climate extremes*. For 
example, European heatwaves (Fig. 1) are more 
frequent when the summertime jet stream is 
‘blocked’ from taking its usual path across 
the North Atlantic Ocean, because stagnant 
air over the continent can warm excessively 
through land-surface feedbacks’. 

On top of this internal variability, climate 
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Figure 1 | The River Loire in France during the 2003 European heatwave. Horton et al.’ re 
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anticyclonic summertime circulation regimes over Europe. 


change itself is expected to lead to changes in 
the occurrence of atmospheric circulation pat- 
terns, although there is as yet little consensus 
on what these changes will be’. Both internal 
variability and the circulation response to cli- 
mate change can lead to regional changes in 
temperature extremes that are larger or smaller 
than — or even opposite to — those expected 
from global warming over extended time 
periods’. 

To deal with the confounding effects of 
atmospheric circulation on changes in tem- 
perature extremes, Horton et al. applied a 
cluster analysis to geopotential height fields 
— which map the heights needed to reach a 
given pressure, taking into account variations 
in surface pressure and atmospheric density — 
at an altitude of about 5 kilometres, to identify 
robust changes in the occurrence of circulation 
regimes. Their approach restricts the analysis 
to the years after 1979, when observations 
from meteorological satellites improved the 
reliability of analysed changes in atmospheric 
circulation at mid-latitudes. But it avoids pre- 
viously voiced concerns about using subjec- 
tive measures of atmospheric circulation’. The 
authors then partitioned observed changes in 
temperature extremes into those associated 
with changes in the occurrence of circulation 
regimes (dynamic changes) and those that 
have no such association (thermodynamic 
changes). This kind of partitioning is a grow- 
ing theme in climate science’. 

Horton and colleagues find that most 
regions have experienced a clear increase in 
summertime high temperature extremes since 
1979. For roughly half of those regions (eastern 
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North America, Europe and western Asia), 
the researchers identify a substantial contri- 
bution to the observed trends (one-third to 
one-half) from the increased occurrence of 
anticyclonic circulations, which are associated 
with atmospheric blocking. The implication is 
that the tendency for increased summertime 
heatwaves that can be expected from global 
warming has been substantially enhanced 
in these regions over the past 35 years by 
circulation changes that make heatwaves 
more likely. 

Although global warming suggests that, 
in general, the likelihood of low temperature 
extremes should decrease, Horton eft al. find 
an increase in wintertime low temperature 
extremes over central Asia since 1990 — the 
period over which the extent of Arctic sea 
ice has rapidly declined. They attribute this 
increase primarily to the increased occur- 
rence of a circulation pattern associated with 
the transport of cold Arctic air into central 
Asia. Modelling studies’ have suggested that 
such an altered circulation is attributable to 
declining sea-ice extent in the Barents and 
Kara seas. However, the authors find no evi- 
dence of increased wintertime low tempera- 
ture extremes over North America, despite the 
attention given to claims of such an increase’. 

One concern with this study is that it is 
based on 35-year (and, in some cases, only 
24-year) time series, which are quite short by 
climate standards. Horton and co-workers 
performed an extensive analysis to determine 
the statistical significance of the trends, but 
this establishes only that discernible changes 
occurred over the periods in question. As the 
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port that this heatwave was part of a long-term trend towards more 
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authors note, the identified circulation trends 
may merely reflect multidecadal natural 
variability and cannot be associated defini- 
tively with climate change. This is clearly an 
important issue to resolve. Other climate 
extremes, such as droughts and flooding, 
may be even more susceptible to circulation 
changes than are temperature extremes. 

A better observational record of past 
changes in atmospheric circulation, over 
the previous century and possibly over even 
longer timescales, would help to put the recent 
changes in context. Such records might be 
acquired by analysing data from ships log- 
books or by developing climate proxies, for 
example. Improved prediction capabilities also 
require a better understanding of the mecha- 
nisms of climate variability. Finally, a better 
understanding of the physics of the expected 
atmospheric circulation response to climate 
change would provide testable hypotheses. In 
particular, can the most extreme predictions of 
circulation changes be constrained or excluded 
by the observational record? Unless progress 
is made on these fronts, we will continue to 
be surprised by deviations from the thermo- 
dynamic expectations of global warming. = 
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Unexpected role 
for vitamin B2 


Anenzyme has been found that alters the molecular structure of vitamin B2, adding 
a fourth ring to its existing three-ring system. The product catalyses new types of 
chemistry in concert with certain other enzymes. SEE LETTERS P.497 & P.502 


CATHERINE F. CLARKE 
& CHRISTOPHER M. ALLAN 


any vitamins work by expand- 
Me: the biochemical repertoire of 

enzyme-catalysed reactions. For 
example, riboflavin (vitamin B2) does this for 
a wide variety of flavoproteins, which cata- 
lyse many of the redox reactions in central 
metabolic pathways and act in the electron- 
transport chains of cells. Two papers in this 
issue, by White et al.’ (page 502) and Payne 
et al.” (page 497), report a previously unknown 
cofactor derived from riboflavin. The findings 
solve the long-standing mystery of how a pair 
of bacterial enzymes — and their counterparts 
in yeast — catalyse crucial reactions known 
as decarboxylations, and further expand the 
repertoire of enzymatic reactions. 

White et al. relate that the molecular struc- 
ture of riboflavin, which contains three rings, 
can be modified by the addition of a prenyl 
group (a hydrocarbon group containing five 


UbiX or Pad1 
—_—__—_——_> 
Me 


ok 


(DMAP) 


Riboflavin: R = OH 
FMNH,: R = phosphate 


carbon atoms, also known as an isoprenyl 
group) to form a fourth ring (Fig. 1). The 
authors call the resulting compound pre- 
nylated flavin mononucleotide (prenylated 
FMNH.,). They used high-resolution crystal 
structures, biochemical studies, spectroscopy 
and computational calculations to character- 
ize prenylated FMNH, and to determine how 
it is employed as a cofactor in decarboxyla- 
tion reactions — in which a carboxylate group 
(CO, ) is removed in the form of carbon 
dioxide from a substrate. 

Surprisingly, the enzyme that forms 
prenylated FMNH, is UbixX, a protein first 
found in the bacterium Escherichia coli. The 
authors show that UbiX works in tandem 
with another enzyme, called UbiD, to mediate 
the decarboxylation of an intermediate in the 
biosynthesis of coenzyme Q (see Fig. 1a of the 
paper’). Coenzyme Q is an essential lipid in the 
electron-transport chain and a potent cellular 
antioxidant’. 

How exactly UbiX and UbiD cooperate 


He apo-UbiD 
re) 4 N Me or apo-Fdcl 
HN | as 
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Prenylated FMNH, 
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50 Years Ago 


As an introduction, I should like 

to touch on something which is 

nota unique feature of the subject 

of discourse. However, this is as 

good an excuse as any to look at the 
disturbing fact that communications 
between scientist and scientist 

are in a state of overgrowth or 
‘overpublication; while in contrast 

to this the writings ... addressed 

to the non-expert are, so some 
believe, somewhat neglected or 
‘underpublished. Of course, one does 
not condemn the vastly increased 
potentialities (5-10 times that of 
1945) for scientific printing space, 
but the sometimes indiscriminate 
multiplication of the printed words, 
formulae and illustrations leads 

to overburdening of libraries, to 
anightmarish frustration of the 
research worker who never catches 
up with his reading, and willy-nilly to 
an encouragement of unfinished and 
mediocre material to be published. 
From Nature 26 June 1965 


100 Years Ago 


All good nomenclature should 

be unambiguous, and, if possible, 
self-explanatory. The terms masse 
volumique, volume massique, and 
stéradian have both these desirable 
qualities; no one with a knowledge 
of physics and French could make 
any mistake as to the exact meaning 
of the first two, and the meaning of 
the third should be at once self- 
evident to anyone who knows the 
definition of solid angle. I should 
not expect a chemist or a botanist to 
have anything but a hazy idea of the 
meaning of puissance massique, but 
even to an ordinary French engineer 
it should convey its meaning 
instantly. An expression of this kind, 
far from being an “eccentricity, is 

a triumph of nomenclature. It is 


Figure 1 | Prenylated riboflavin. Flavin mononucleotide (FMNH,) is a common enzyme cofactor that 
is derived from riboflavin (vitamin B2). White et al.' show that the bacterial UbiX enzyme and the fungal 
Pad1 enzyme modify FMNH, in the presence of dimethylallyl monophosphate (DMAP), a precursor of 
prenyl groups, to generate a previously unknown cofactor, prenylated FMNH,. Payne et al.” show that 
prenylated FMNH, is an essential cofactor for the decarboxylase enzymes UbiD and Fdc1; apo-forms of 
UbiD and Fdcl lack a bound cofactor and so are inactive. Me, methyl group. 
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possible to mould language by logic; 
it is the only way to mould language 
that shall be truly scientific. 
From Nature 24 June 1915 
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to perform the decarboxylation reaction has 
been quite puzzling. Both enzymes are pre- 
sent in a wide array of prokaryotes (bacterial 
and archaeal microorganisms), and many 
other microbes, including fungi, rely on 
homologous enzymes — proteins that share 
ancestry with UbiD and UbiX — to decar- 
boxylate various acids. White et al. show that 
prenylated FMNH, is first oxidized by Fdcl, 
a fungal homologue of UbiD, and then used 
to decarboxylate cinnamic acid (see Fig. 1a of 
the paper’). 

So how does prenylated FMNH, form? 
White et al. reveal that when UbiX is supplied 
with dimethylallyl monophosphate (a precur- 
sor of prenyl groups), it prenylates riboflavin’s 
three-ring system. This results in the formation 
ofa fourth ring to generate prenylated FMNH, 
(Fig. 1). The authors show that this previously 
unknown cofactor enables Fdc1 to decarboxy- 
late substrates in vitro, in the absence of UbixX. 
These results explain the findings of other 
studies** that showed that an unknown small 
molecule — but not UbiX itself — is required 
to activate the ability of UbiD (and Fdc1) to 
decarboxylate substrates. 

Payne et al.’ show that apo-Fdcl (an apo- 
enzyme is one without its bound cofactor) 
catalyses unusual biochemistry only when in 
the presence of prenylated FMNH,. They find 
that Fdcl first oxidizes prenylated FMNH, to 
generate an ‘iminium’ form of the cofactor. 
This activates and prepares Fdc1 to decarbox- 
ylate a wide assortment of substrates known 
as a,B-unsaturated aromatic carboxylic acids. 
Many of these derive from the microbial break- 
down of lignin, the structural component of 
the secondary cell walls of plants®. The authors 
propose a mechanism for these decarboxyla- 
tion reactions known as 1,3-dipolar cyclo- 
addition (see Fig. 4d of the paper’). Although 
this mechanism is well known to organic 
chemists, its use by enzymes has until now 
been speculative’. 

The findings raise several questions about 
the biological use of prenylated riboflavin 
cofactors. First, there is no obvious amino- 
acid sequence for binding prenylated FMNH,: 
UbiX and UbiD (or Fdc1) do not share similar 
amino-acid sequences or structures, and the 
interactions of these proteins with the prenyl 
group seem to be restricted to providing an 
appropriately shaped binding site. This com- 
plicates the identification of other enzymes 
that might make use of this cofactor. 

Itis also surprising that UbiX uses dimethyl- 
allyl monophosphate as a source of prenyl 
groups, rather than the commonly used 
source dimethylallyl diphosphate. It will be 
important to determine the metabolic origin of 
dimethylallyl monophosphate and to explore 
whether it is used in other enzyme reactions. 
Dimethylallyl diphosphate is generated either 
through a series of biochemical reactions 
known as the mevalonate pathway, or through 
the methylerythritol phosphate pathway. In 
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both pathways, dimethylallyl diphosphate is 
reversibly converted to another compound, 
isopentenyl diphosphate. A third, related 
pathway has recently been characterized* in 
the archaeon Thermoplasma acidophilum. 
This alternative route directly produces iso- 
pentenyl monophosphate — could it be that 
this compound interconverts with dimethy- 
lallyl monophosphate, in the same way that 
dimethylallyl diphosphate and isopentenyl 
diphosphate interconvert? 

Can Pad1 (the fungal homologue of UbiX) 
and Fdcl also mediate decarboxylation steps 
in the biosynthesis of coenzyme Q? UbiX from 
E. coli and Pad1 from yeast have been shown 
to perform the same function: a mutant form 
of E. coli in which the ubiX gene is deleted can- 
not synthesize coenzyme Q, but the synthesis is 
restored’ if the mutant is engineered to express 
yeast Pad1. The two current papers now iden- 
tify both Pad1 and UbiX as FMN prenyltrans- 
ferases, enzymes that synthesize prenylated 
FMNH, as a diffusible small molecule. 

But what about Fdcl and UbiD? Do these 
enzymes both recognize 3-polyprenyl-4- 
hydroxybenzoic acid (the substrate that 
is decarboxylated in the biosynthesis of 
coenzyme Q)? Unfortunately, 3-polyprenyl- 
4-hydroxybenzoic acid is not commercially 
available, and so the authors of the current 
papers could not perform direct assays of Fdcl 
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or UbiD with this substrate. But the synthesis 
of coenzyme Q is not impaired when the genes 
that express Pad1 and Fdcl are both deleted 
from two species of yeast””®. The enzyme 
responsible for the decarboxylation step of 
coenzyme Q biosynthesis in eukaryotes (organ- 
isms that include fungi, plants and animals) 
therefore remains an open question. m 
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Sugar for sight 


Retinitis pigmentosa causes the death of cone cells, leading to blindness. A factor 
secreted from rod cells, RACVF, promotes cone survival in a mouse model of the 
disease. It now emerges that RdCVF works by increasing glucose uptake in cones. 


CONNIE CEPKO & CLAUDIO PUNZO 


cells called cones endow us with rich, 
high-acuity colour vision. By contrast, 
their night-time counterparts, rod cells, are so 
light-sensitive that they enable us to see when 
walking in the woods under starlight. Sadly, a 
mutation in any one of more than 200 genes 
can cause diseases such as retinitis pigmentosa 
that bring about photoreceptor degeneration 
and lead to blindness'*. Many of these disease- 
causing genes are rod-specific, affecting only 
night vision, but when rods die, the malfunc- 
tion and death of cones soon follows. Writing 
in Cell, Ait-Aliet al.* provide evidence for an 
interaction between these two cell types that 
might explain why cones depend on rods for 
their survival, and that might eventually lead 
to a therapy for such diseases. 
Retinitis pigmentosa affects 1 in 3,000 people’, 
and effective treatments are sorely needed. The 
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large number of disease-causing mutations 
means that these treatments should be gene- 
independent, focusing instead on targeting 
the biological pathways that cause the cones’ 
death. The reasons for their death are unclear. 
However, there is evidence that several factors 
can contribute to cone death in retinitis pig- 
mentosa. These include hyperoxia (an excess 
of oxygen), which causes oxidative damage by 
increasing the levels of free radicals*®; a lack 
of energy’; anda lack of intermediates in the 
anabolic processes by which large molecules 
are constructed from smaller ones’. 

In agreement with a role for such factors, 
hyperactivation of the protein complex 
mTORI, which controls cell metabolism by 
balancing demand with supply, increases cone 
survival’. This protein complex probably acts 
by promoting the expression of genes that 
improve glucose uptake and use, raising levels 
of anabolic intermediates and of an anabolic 
cofactor molecule called NADPH. Adequate 


a_ Healthy retina 


Glucose 


Blood vessel 


b_ Retinitis pigmentosa 


c RdCVF treatment 


Figure 1 | Glucose uptake in cone cells. a, Intermingled rod and cone photoreceptor cells in the outer 
portion of the retina receive glucose from blood vessels, through retinal pigmented epithelial (RPE) 
cells. Glucose uptake in photoreceptors is mediated by the transporter protein Glut-1, which interacts 
with another protein, Basigin-1. Ait-Ali et al.> demonstrate that a protein secreted from rods, called 
rod-derived cone viability factor (RdCVF), associates with Basigin-1 to improve glucose uptake and 

the production of energy from glucose by inducing the formation of a more active form of Glut-1.b, In 
diseases such as retinitis pigmentosa, in which rods die, cones seem to be unable to take up enough 
glucose to fuel their metabolism, and so also die. c, The authors show that this defect can be prevented by 


addition of RACVE. 


levels of NADPH are likely to be crucial to 
cone survival because, in addition to its role in 
anabolic processes, it is needed for pathways 
that detoxify free radicals in hyperoxic retinas. 
Injection of antioxidants’ or viral-vector deliv- 
ery of genes that fight oxidation® prolong cone 
survival in mouse models of retinitis pigmen- 
tosa, supporting the theory that oxidation is a 
cause of cone death. 

Healthy photoreceptors are metabolically 
very active’, and so require high levels of glu- 
cose, which is delivered from the bloodstream 
through retinal pigmented epithelial cells. 
Because NADPH is produced by the oxida- 
tion of glucose, the demand for glucose in 
hyperoxic conditions is likely to be exception- 
ally high. A glucose transporter protein called 
Glut-1, located on the cell surface, mediates 
glucose uptake by photoreceptors, and evi- 
dence suggests’ that a failure to take up suf- 
ficient glucose might contribute to cone death 
in retinitis pigmentosa. But there is a puzzling 
aspect to this model — glucose is delivered to 
the retina at a high rate and, after the death of 
rods, cones should have access to higher than 
normal levels of glucose. This suggests that 
there must be an added level of complexity 
underlying glucose uptake in cones. 

Another factor that supports cone-cell 
survival’® is a protein secreted from rods, called 
rod-derived cone viability factor (RACVF)", 
that may have antioxidant activity. The cones 
of mice lacking RdCVE are more susceptible to 
oxidative damage than those of controls, and 


these mice show reduced photoreceptor activ- 
ity with ageing”. Ait-Ali et al. therefore set out 
to explore the mechanism by which RACVF 
promotes cone-cell survival. Using mass 
spectrometry, they identified a protein, called 
Basigin-1, that binds to RACVE Basigin-1 is 
found on the surface of cones and is known” 
to cause retinitis pigmentosa when mutated 
in mice. The authors also identified Glut-1 as 
a Basigin-1-binding protein. But a previous 
study showed that, contrary to what might 
have been expected, loss of Basigin-1 did not 
affect the expression of Glut-1 or, for the most 
part, its distribution in the retina”. 

Ait-Ali and colleagues observed that 
addition of RdCVF increased glucose uptake, 
lactate release and ATP production in 
photoreceptor cells in vitro — three cellular 
responses suggesting that RdCVF increases 
metabolic flux. Furthermore, the authors 
found that a decrease in levels of Basigin-1 
and Glut-1 eliminated the ability of RACVF to 
promote photoreceptor survival. The authors 
propose that RdCVE, Basigin-1 and Glut-1 
form a complex at the cell surface that 
increases glucose uptake (Fig. 1). However, 
the level of Glut-1 on the cell surface did 
not increase after RdCVF addition, leading 
the researchers to suggest that this complex 
instead acts to increase levels of the active 
form of Glut-1. Future work should test this 
model. It will also be of interest to study the 
potential antioxidant role of RACVE, together 
with the formation and activity of this 
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three-protein complex, which might, as Ait-Ali 
et al. suggest, depend on a redox-sensitive 
interaction between RdCVF and Basigin-1. 

A study published earlier this year showed 
that the delivery of RACVF in mice with retini- 
tis pigmentosa using an adeno-associated virus 
(AAV) prolonged cone survival and function”. 
AAV is a safe and effective vector that is used 
for ocular gene therapy in humans”®, and this, 
together with Ait-Ali and colleagues’ finding 
that Basigin-1 is expressed in human retinas, 
suggests that AAV-RdCVF might be an effec- 
tive way to treat many types of photoreceptor 
disease. Owing to the large number of disease 
genes that cause blindness in humans, a treat- 
ment that could promote the survival of cones 
in a gene-independent manner would be a 
welcome prospect. = 
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ADDENDUM 

The News & Views article ‘Alzheimer’s 
disease: From big data to mechanism’ by 
Vivek Swarup and Daniel H. Geschwind 
(Nature 500, 34-35; 2013) commented on 
the paper ‘Integrative genomics identifies 
APOE ¢4 effectors in Alzheimer’s disease’ 
by H. Rhinn et a/. (Nature 500, 45-50; 
2013). This paper has now been retracted. 
For further information, see http://dx.doi. 
org/10.1038/nature14591 
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The changing form of Antarctic 


biodiversity 


Steven L. Chown!, Andrew Clarke’, Ceridwen I. Fraser’, S. Craig Cary’, Katherine L. Moon? & Melodie A. McGeoch! 


Antarctic biodiversity is much more extensive, ecologically diverse and biogeographically structured than previously 
thought. Understanding of how this diversity is distributed in marine and terrestrial systems, the mechanisms 
underlying its spatial variation, and the significance of the microbiota is growing rapidly. Broadly recognizable 
drivers of diversity variation include energy availability and historical refugia. The impacts of local human activities 
and global environmental change nonetheless pose challenges to the current and future understanding of Antarctic 
biodiversity. Life in the Antarctic and the Southern Ocean is surprisingly rich, and as much at risk from environmental 


change as it is elsewhere. 


approaches used to document patterns in and the functional 

significance of the diversity of life. Knowledge of the global 
distributions of organisms now extends to many groups. These include 
terrestrial plants and animals’, marine taxa’, components of the soil 
biota’, and, increasingly, microorganisms*. Empirical and theoretical 
research is verifying the range of mechanisms, including energy and 
nutrient availability, and historical contingency, that are responsible 
for these patterns'”, and the role of diversity in maintaining ecosystem 
services. New approaches are also revealing rapid changes in diversity 
across most ecosystems, their often negative implications for the future 
of the planet, and our struggle to address the problem’. 

In tandem, biodiversity research has increased across the Antarctic 
and Southern Ocean. The growing interest has been driven in part by the 
realization that any fundamental quest to understand life’s diversity 
requires exploration of the polar regions®’. But it has also been under- 
pinned by an increasing appreciation of the particular challenges faced 
by Antarctic species and ecosystems. Climate change®, and economic 
activity in the form of fishing, tourism, and scientific research itself”, 
are all having impacts on Antarctic diversity. The intricacy of Southern 
Ocean food webs, best known for the region’s iconic seals, penguins, 
whales and albatrosses, has been further unveiled, highlighting the 
complexity of the simultaneous impacts of changing climates, sea ice 
distribution and fisheries on this system'!’*. New marine research has 
also exposed unexpectedly high benthic diversity, including in the deep 
sea'*"* In terrestrial areas, the diversity of higher plants and animals is 
relatively low (for example, just two flowering plant species), but the 
diversity of lichens, bryophytes, invertebrates, and the microbiota is 
substantial’*. Local and regional spatial patterning in these groups is 
extensive’®, with the mechanisms underlying these patterns both recog- 
nizable from broader ecological theory and unusual in several respects. 

Here we explore recent advances in our understanding of the biodi- 
versity of the Antarctic. Given the many conservation challenges it faces, 
we also consider how well the region is performing in relation to an 
aspirational set of global conservation benchmarks, the Aichi Targets’. 
Biodiversity conventionally encompasses variability across the entire 
biological hierarchy, from genes to ecosystems and their functioning. 
In the Antarctic, considerable recent progress has been made on all of 
these fronts'’. In this review we focus primarily on ecological diversity, 


T he past two decades have witnessed a revolution in the 


its drivers, and changes that are being wrought by both local anthro- 
pogenic and global environmental change. We first bring together 
recent studies that have advanced our understanding of the diversity 
of marine and terrestrial groups, including the microbiota, and their 
evolution in the region. We then show how ecological and historical 
processes shape diversity and how they might change in the future. 
Finally, we assess Antarctic conservation in a global context, and high- 
light the biodiversity research challenges facing the region. Doing so is 
especially timely given the escalating political prominence of the 
Antarctic, due to debates about use of its resources, and the growing 
extent of science, fishing and tourism and their likely impacts. 


Marine richness, species flocks and diversification 


The traditional view of Antarctic marine metazoan diversity has been of 
a species-poor fauna constrained by a harsh environment. Some groups 
are genuinely depauperate in the Southern Ocean, notably teleost fish, 
gastropods, bivalves and some crustaceans. Initial work on these groups 
largely shaped early views of the fauna as a whole. By the turn of the 
century, however, benthic diversity was thought to be high, with much 
of the fauna remaining undiscovered or undescribed'*. Substantial 
sampling and systematic campaigns associated with the Census of 
Marine Life have since supported this idea’’. More than 8,000 valid 
marine species are now known for the Southern Ocean, most of which 
are benthic’*. Many of the new taxa have come from sampling of the 
deep Southern Ocean. For example, campaigns in the deep Weddell Sea 
recovered 674 isopod species, of which more than 80% were new to 
science’*. The Southern Ocean is clearly not the universally depauperate 
region it was originally thought to be. Benthic sampling has also 
improved understanding of the evolution of unusual groups such as 
the acorn worms. Individuals collected along the West Antarctic 
Peninsula and in the Ross Sea live in secreted translucent tubes, making 
them similar to acorn worms of the Middle Cambrian, demonstrating 
conservation of this behaviour for 500 million years”. 

Recent work has consolidated the view that strong latitudinal marine 
diversity gradients, with declines in species richness to the poles, are 
more general for the northern than southern hemisphere’’. Indeed, for 
groups such as hexacorals (excluding scleractinians), pycnogonids and 
bryozoans, current knowledge suggests that diversity in the Southern 
Ocean is comparable with many temperate and non-reef tropical 
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habitats’*. In others, such as the petrels, albatrosses and their relatives, 
diversity patterns are atypical, with diversity peaks not in the tropics, nor 
at the highest southern latitudes, but rather in the productive areas 
between 35° S and 50° S”. The distinctiveness of the Southern Ocean 
fauna is also much higher than previously thought. Recent estimates 
suggest that 50 to 97% of Southern Ocean species in various groups such 
as sponges, tube worms, amphipods, molluscs, isopods, sea spiders and 
notothenioid fish are endemic’. The deep-sea fauna is also quite differ- 
ent to that of other regions’’. For example, the hydrothermal vent faunas 
of the East Scotia Sea lack the polychaete worms, alvinocaridid shrimp 
and bathymodiolid mussels typical of other vent systems; instead these 
vents are characterized by peltospiroid gastropods, stalked barnacles, sea 
stars, and a previously unknown anomuran yeti crab species”’. These 
differences suggest that the insights into metabolic functioning and the 
evolution of symbioses that have come from the exploration of deep-sea 
hydrothermal vents elsewhere” may be considerably extended through 
examination of Antarctic vent faunas. 

Investigation of the high marine endemicity has revealed how a com- 
plex set of earth system processes, including glaciation, differential 
diversification, and isolation, has interacted to shape the evolution of 
the southern biota”. In turn, the outcomes of this evolutionary inter- 
action, such as the uniquely high prevalence in the region of marine 
invertebrate species lacking a planktonic larval phase”, have profound 
consequences for responses to ongoing global environmental change, 
and for forecasting them, owing to variation in dispersal capability. 

One factor contributing to the rapid rise in estimates of diversity of 
the Southern Ocean has been the identification, typically using molecu- 
lar approaches, of many cryptic species'’. For example, within each of 
four Orchomenella amphipod species, several well-supported clades 


Species flock radiations 
(e.g., notothenioid fishes) 


‘Antarctic clade’ 
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Figure 1 | The Antarctic region is neither as isolated nor as depauperate 
in biodiversity as once thought. Oceanic fronts can be permeable, with meso- 
scale eddies transporting pockets of water and associated organisms across 
them (inset figure lower left modified from ref. 109, with permission from 
Elsevier). Dispersal of marine species around Antarctica has been facilitated by 
the Antarctic Circumpolar Current (ACC) and Antarctic Coastal Current 
(ACoC), as well as by trans-Antarctic seaways during some interglacials”’. 
Glacial cycles have driven diversification, with some species surviving within 
the Antarctic region during glacial periods—for example, in terrestrial 
refugia such as geothermal areas®' or marine shelf refugia*°—and undergoing 
allopatric divergence. Species flock radiations have been inferred for some 


432 | NATURE | VOL 522 | 25 JUNE 2015 


ANTARCTICA 


have been found that probably represent reproductively isolated 
species”’. Evidence of cryptic speciation is also present in Antarctic 
isopods”*, echinoderms”, and sea slugs. In the latter, an original single 
species, Doris kerguelenensis, is now thought to comprise a species flock, 
with diversification driven by predation*®. Several other invertebrate 
groups, including echinoids, isopods and amphipods, may also qualify 
as marine species flocks*’. Notothenioid fishes have long been recog- 
nized as a species flock, particularly those with antifreeze glycopro- 
teins*', and the broad group includes several sub-clade radiations that 
have been linked to periods of climate change” (Fig. 1). Exploration of 
the way in which these species flocks have developed has provided much 
insight into evolutionary processes associated with rapid diversification, 
and those associated with survival in Antarctic waters”. 

The cyclical waxing and waning of the ice sheets has been a major 
driver of range fragmentation and allopatric speciation in several marine 
benthic groups. Although much of the shelf and slope will have been 
inimical to life at glacial maxima™, and some shelf taxa undoubtedly 
retreated to the deep sea, palaeobiological and biogeographic evidence, 
coupled with recent molecular research, indicates that many species 
have persisted on the shelf throughout numerous glacial cycles”. 
These taxa could have survived glacial periods in situ in regions 
where the ice sheet did not extend to the edge of the continental 
shelf**, particularly where polynyas were formed by katabatic winds at 
shelf margins. Changes in sea level, together with partial collapses of the 
West Antarctic Ice Sheet between 1.1 million years and 125,000 years BP, 
may have allowed faunal exchange through seaways across the Antarctic 
continent, now closed by ice” (Fig. 1). These events have resulted in 
substantial biogeographic structuring*’. The Southern Ocean is there- 
fore not a relatively uniform region, but is biogeographically complex, 
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groups, such as Antarctic notothenioid fish which have diversified during 
periods of climate change (tree derived from ref. 32). The mean positions of the 
major oceanic fronts are shown: Southern Antarctic Circumpolar Current 
Front (SACCF), Antarctic Polar Front (APF), Subantarctic Front (SAF) and the 
Subtropical Front (STF). Locations of possible terrestrial geothermal refugia 
(volcanoes and heated rocks) are indicated by yellow dots®. Antarctic bedrock 
(>0 m) is roughly indicated in pale brown (derived from ref. 110), and the 
Antarctic continental shelf in blue (not shown for other continents). The 
diversity and position of the 15 distinct Antarctic Conservation Biogeographic 
regions“ are indicated in the central panel on the right. 
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with considerably more local structuring than previously thought. Few 
taxa appear to be truly circumpolar, and in species initially believed to 
have broad (or even circumpolar) distributions subsequent phylogeo- 
graphic work is revealing multiple species/divergent lineages with more 
restricted ranges”””’. 


Diversity in the terrestrial biota 


Although flowering plant diversity is low in the Antarctic, other groups, 
such as the lichens (>200 species) and mosses (>100 species), are 
comparatively species-rich”. Some terrestrial invertebrate groups are 
also well represented, including the tardigrades, nematodes, springtails 
and mites***'. The distribution patterns of these organisms are highly 
variable. Recent phylogeographic work shows that while some species 
appear truly circum-Antarctic in their distribution (for example, rotifers 
and nematodes*™”), others, such as springtails, are much more differ- 
entiated**. By coupling the growing availability of continent-wide bio- 
diversity databases with ecoinformatic approaches, recent analyses have 
shown substantial spatial patterning in diversity across the continent, 
with fifteen distinct ecoregions now recognized on the continent itself, 
and another eight across the Southern Ocean islands**. Much diversity 
within species, at local spatial scales of hundreds of metres to hundreds 
of kilometres, is also being discovered through phylogeographic 
approaches, reflecting the effects of both older glacial history and more 
recent events’. 

As with marine groups, substantial cryptic diversity has been unveiled 
in the terrestrial metazoan biota’*** and its diversity is probably much 
higher than is currently appreciated. Moreover, for most groups, includ- 
ing those that are considered well known, the full extent and structure of 
the continent’s diversity remains partially understood because much of 
it has not been thoroughly surveyed*®. Although the overall area of ice- 
free land is small, about 46,000 km? (about 1/3 the area of England) or 
less than 0.5% of the continent’s surface area, a great deal of it is remote 
and difficult to access. 

The bulk of Antarctica’s terrestrial diversity lies in the less visible 
groups, represented by the microbiota. Metagenomics has substan- 
tially improved current understanding of both its structure and 
functioning®*”“*. Early, culture-dependent studies provided only a 
superficial glimpse of the resident terrestrial microbial diversity. 
Independent, comparative studies from around the continent, using 
next-generation sequencing technologies and metagenomic approaches, 
concur that terrestrial microbial community phylogenetic and taxo- 
nomic diversity are typically higher than once thought. Nonetheless, 
this diversity is lower than in many temperate soil systems, probably 
the result of selection imposed by the generally cold, dry, oligotrophic 
nature of the Antarctic soil system**’, though in some areas soils are 
moist and enriched by the activities of seabirds and seals. 

At the phylum level, Antarctic soils appear to be structured similarly 
to soils elsewhere, dominated by the Acidobacteria, Proteobacteria, 
Actinobacteria and Bacteroidetes. However, when examining individual 
‘species’ composition, the picture is very different, suggesting selection 
for a highly adapted and specialized community. Metagenomic 
approaches have revealed much higher community-level genetic plas- 
ticity than originally suggested for these extreme oligotrophic mineral 
soils, especially in autotrophic, heterotrophic, and diazotrophic life- 
styles, along with significant representation of thermal, osmotic, dorm- 
ancy and nutrient limitation stress-response pathways’'’’. These 
metagenomic surveys are beginning to provide clues as to what might 
drive community assembly and interactions and ultimately their suc- 
cess. They have also identified the presence of antibiotic resistance genes 
and a prominence of phage signals’, suggesting that some level of cell 
competition and interaction may exist that could also play a critical 
role in structuring these communities where the role of grazers 
remains uncertain. 

Terrestrial Antarctica also harbours a remarkable diversity of aquatic 
systems ranging from thousands of small ponds and lakes scattered 
across the terrestrial and ice shelf landscape to the recently accessed, 
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extensive network of subglacial lakes and streams. The lakes and ponds 
range from permanently ice covered to perennially ice free, freshwater to 
hypersaline, and highly stratified to permanently mixed, with much 
variation in geochemistry®. These features are all microbially dominated 
and, like Antarctic soils, generally support relatively low diversity com- 
munities when compared to similar temperate systems”™*. Metagenomic 
surveys suggest that this might not be the case for viruses, with Antarctic 
lakes supporting the highest diversities of viruses in any aquatic system 
yet examined’, emphasizing their functional and evolutionary signifi- 
cance”. Microbial mats, especially important systems in polar aquatic 
environments, also harbour high diversity, with members that include 
cosmopolitan and apparently endemic elements, both in mats on land 
and those on ice shelves”*. Perhaps most noteworthy from an evolution- 
ary perspective, the retrieval of samples from subglacial Lake Whillans 
has validated the assertion that life exists below the Antarctic ice sheets”. 
The lake supports a diverse, metabolically active, chemosynthetic com- 
munity of Bacteria and Archaea, possibly relying on the oxidation of 
ammonia originating from microbial mineralization in the sediments 
laid down some 120,000 years ago. 


Drivers of diversity 


Globally, the drivers of diversity have become increasingly well charac- 
terized, although controversy about their relative significance continues. 
Historical events such as glaciations and continental drift, along with 
dispersal have played major roles. Energy availability, in the form of 
primary productivity, is likewise important for determining variation in 
species richness’. Other factors influencing diversity include temper- 
ature, surface area, spatial and temporal heterogeneity in conditions, 
and interactions among organisms such as through predation and dis- 
ease. Spatial scale and location play major roles in determining which of 
these factors are most significant. For Antarctic marine and terrestrial 
systems the same sets of drivers are often important. 

In marine systems, richness varies with available energy, disturbance, 
and local heterogeneity, while climate cycles and long-term oceanic 
conditions have influenced which groups are diverse and which are less 
well represented’. Nonetheless, some of the drivers of diversity in the 
Antarctic are unusual, providing unique insights into the ongoing con- 
troversy. For example, spatial variation in the global richness of alba- 
trosses, petrels and their relatives, which peak in diversity in the region, 
is positively related to the availability of large, contiguous areas of ocean, 
temperature and wind speed, all of which nonetheless represent forms of 
available energy”. Wind energy is especially interesting because it is 
rarely considered during investigations of the drivers of species richness, 
but can clearly be important, as studies of individual species of seabirds 
have demonstrated**. Moreover, winds are strengthening substantially 
in the region as a consequence of a southward shift and increase in 
intensity of the Westerlies, partly as a result of anthropogenic ozone 
depletion®. For species which rely on wind energy these changes should 
prove beneficial. Recent analyses of long-term life history and popu- 
lation data on wandering albatrosses support such an effect*’. For breed- 
ing adults on the Crozet Islands (located at 46° S in the Southern Indian 
Ocean), rates of travel and flight speeds have increased and the duration 
of foraging trips has decreased over the past 20 years. Breeding success 
has consequently improved. Breeding birds have also gained about 1 kg 
in mass (individual birds weigh about 8-11 kg), which may represent an 
adaptive response to windier conditions. 

In terrestrial Antarctic systems, as is the case for those elsewhere, the 
availability of energy and water are the two most important drivers of 
metazoan diversity. Across broader scales, moss, lichen and liverwort 
species richness is positively related to temperature’*®, though south of 
72° § a suite of local microclimate factors including temperature, avail- 
able water, light and shelter may be more important than energy alone. 
These gradients emphasize the importance of temperature in dictating 
the availability of water, though at local scales individual species dis- 
tributions are typically also influenced by soil parameters, such as car- 


bon and nitrogen availability”. Where geothermal energy warms the 
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ground, the availability of resources is higher both spatially and season- 
ally. Such consistent energy availability has played a major historical role 
in determining patterns of diversity across the continent. Analyses of a 
large database on the distribution of diversity in the region have revealed 
that for plants and fungi (mostly lichens) the presence of geothermal 
sites, such as volcanoes, explains much of the variation in diversity. This 
pattern extends beyond the immediate vicinity of these areas, suggesting 
that such sites have served as refugia for long periods, including 
during the Pleistocene glaciations®’. Refugia such as these, as well as 
inland nunataks and coastal areas, have probably all had major roles 
in determining the distribution of many of the continent's terrestrial 
organisms”. The extent to which the importance of these drivers varies 
among different groups remains unknown, however, because of limited 
biodiversity surveys. Nonetheless, the importance of energy availability 
and refugia in terrestrial systems mean that the drivers of diversity for 
metazoans here have many parallels with those of the Antarctic marine 
environment. 

Although most macrofauna and even some micro-eukaryotes show 
strong continental regionalization, much local structural heterogeneity 
in soil microbial communities has been revealed by regional and land- 
scape-scale studies. Prokaryotic diversity variation, especially south of 
70° S, may thus be driven predominantly by local microclimate and 
landscape history**™. Soil surface ages and frequency of water availabil- 
ity establish severe gradients in pH, soil moisture and salinity that 
appear to be primary drivers for microbial community selection. 
Certain dominant components of the edaphic community, such as 
cyanobacteria, may be structured both by niche-related factors, such 
as in the case of communities living below and within rocks™, and in 
other circumstances by dispersal-related spatial filters (that is, wind 
processes) at a local scale. Recent studies examining the presence, com- 
position, and local contributions of a possible continental-wide Aeolian 
microbial community suggest that this community is primarily of 
Antarctic origin with some contribution of more resilient taxa from 
off-continent sources. The Antarctic aerosolized community appears 
to have little impact on the less optimal cold, dry, edaphic mineral soil 
communities, but makes a substantial contribution to the substantially 
more selective geothermal soils”. 

For most lakes and ponds, geochemistry appears the primary driver 
for microbial community composition and structure, in keeping with 
the situation for many water bodies globally. Although mixing between 
nearby and disparate ponds and lakes probably occurs®’*, those with 
similar geochemistry (that is, conductivity and/or nutrients) support 
similar communities, at least at the higher taxonomic level, regardless 
of location” ”°. This appears especially true for the more isolated bottom 
water of permanently stratified lakes, as evidenced by the high level of 
sequence divergence seen in certain groups’!. Even some of the micro- 
eukaryotes appear to have very specific geochemically driven require- 
ments, while others are considerably more cosmopolitan”. 


Diminishing isolation 

An enduring concept is that the Antarctic biota has been isolated since 
the breakup of Gondwana. Geologically, the terrestrial link between 
Antarctica and South America was severed in the Eocene, and deep- 
water flow of the Antarctic circumpolar current (ACC) was initiated in 
the Late Oligocene”’. With the onset of the ACC came circumpolar 
thermal fronts such as the subtropical front (STF) and Antarctic polar 
front (APF) (Fig. 1), which define the boundary zones between 
Antarctic, sub-Antarctic and temperate waters. Steep temperature gra- 
dients and limited water exchange across these zones created dispersal 
barriers, exacerbated by strong circumpolar winds and currents that 
promote eastward dispersal, but hinder north-south movement’*”. 
Although the APF has been a formidable challenge to faunal exchange, 
it is not an absolute barrier”®’’. Phylogeographic research now supports 
the idea of dispersal across the APF in a range of marine groups, such as 
molluscs’*”’, echinoderms***’, and crustaceans”. Many successful colo- 
nizations appear to have preceded Pliocene intensification of the ACC”, 
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Conceptualized impacts of local (resource exploitation, invasive 
species) and global (climate change, including increasing CO. 
concentrations which result in ocean acidification) environmental 
change drivers on Southern Ocean systems. Direct impacts are shown 
by light blue arrows and indirect impacts by light red ones. The large 
red and green arrows indicate declines or increases in populations, 
respectively. Human impacts commenced with the exploitation of 
whales and seals, which has now largely ceased. The main focus of 
exploitation is now fish?©8 and krill, which not only has direct impacts 
on their populations, but can also affect other species which either 
depend on these organisms as a resource!®, or are caught incidentally 
(such as some seabirds). Fishing outside the region also influences 
seabird populations as a consequence of incidental mortality. Climate 
change is having complex impacts through changes in sea ice 
distribution and temperature®’”?©. In the future, limited physiological 
tolerances and their capacity for change may have considerable 
effects both on vertebrates and invertebrates!!1°°. Ocean acidification 
is already affecting calcified organisms?°’, and may influence krill via 
impacts on eggs!°8. Although records of marine invasive species are 
uncommon, introductions, such as of crabs, may pose considerable 
threats to local communities®. 
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but occasional dispersal events into and out of Antarctic waters have 
occurred since that epoch’**’. Similar dispersal events across the APF, 
over the same timescales, have occurred in terrestrial bryophytes and 
invertebrates**°. For important elements of the terrestrial microfauna, 
such as the tardigrades, rotifers and nematodes, information remains 
too sparse to reach general conclusions", although long-term isolation 
in the continental Antarctic seems to be the case for some groups such as 
the mites and springtails®. 

The rarity of natural dispersal events has, until recently, allowed 
Antarctica’s biota to evolve and diversify in relative isolation®’. Signals 
of natural dispersal, colonization and diversification for Antarctica are 
now at risk of being overwhelmed by impacts associated with changing 
climates and rapidly increasing human movement both into the region 
and between its distinct ecoregions. For example, 20% of tourist and 
45% of scientific visitors to Antarctica carry plant propagules from 
species growing elsewhere, with more than 70,000 seeds estimated to 
be entering the region annually**. Many of these are from regions such as 
the Arctic and sub-Antarctic, where conditions are similar to more 
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Table 1 | The relevance to the Antarctic of the 20 Aichi Targets under the Strategic Plan for Biodiversity 2011-2020 of the Conference of 


Parties to the Convention on Biological Diversity. 


Aichi Target topic areas 


L* Awareness of biodiversity values. 
2* Biodiversity values integrated into planning and reporting. 
3* Incentives for conservation and sustainable use. 
4 Impacts of natural resource use maintained within safe ecological limits. 
5 Reduction in the rate of habitat loss. 
6* Sustainable fisheries and aquatic ecosystems. 
7 Sustainable production systems. 
8* Pollution reduction. 
o* Prevention and management of biological invasions. 
10* Pressures on marine ecosystems impacted by climate change. 
i* Targets for protected areas. 
12* Improvement in species conservation status. 
13 Safeguarding genetic diversity of valuable species. 
14* Restoration of ecosystem services. 
15 Contribution to climate change mitigation. 
16* Fair and equitable access and benefit sharing. 
17? Up to date biodiversity strategy and action plans. 
18 Respect for traditional knowledge. 
19* Improved knowledge of biodiversity. 
20* Mobilization of financial resources. 


Targets with an asterisk are most relevant to the region. 


benign parts of the continent, making establishment in Antarctica 
more likely. Some of these species have already established themselves 
at several locations, such as annual bluegrass, Poa annua, which is a 
superior competitor to the only two indigenous vascular plant species 
found along the Antarctic Peninsula**. Newly arrived terrestrial species 
continue to be documented*, and climate change is expected to increase 
the likelihood of establishment and further impact by non-indigenous 
species. 

Some parts of the Antarctic marine system may be similarly affected, 
especially shelf environments. Ship traffic to the Antarctic carries a 
diverse assemblage of non-indigenous species as fouling*’, and such 
traffic is heavy to the Antarctic Peninsula. Here, one widely-discussed 
risk is the spread of crabs with the potential for substantial changes to 
the benthic system* (Box 1). What the risks are for invasion of other 
areas of the Antarctic by marine groups is not clear, with understanding 
for the broader region poorly developed compared with terrestrial sys- 
tems, stimulating controversy about the likely significance of marine 
invasions”. 


Antarctica and the strategic plan for biodiversity 
Knowledge of the negative consequences of global change for the 
increasingly apparent scope and novelty of Antarctic biodiversity is 
growing (Box 1). A useful perspective on how well conservation of the 
region is performing by global standards comes from the Convention on 
Biological Diversity’s (CBD) Aichi Targets of the Strategic Plan for 
Biodiversity 2011-2020°. 


Relevance of Target to the Antarctic 


While some Antarctic Treaty nations have coordinated programs 
promoting awareness of the value of the region, others do not. No 
coordinated regional program to do so exists. 

Biodiversity values are recognized in the Antarctic Treaty System 
through the CEP. 

Most applicable to nations fishing in the region, though the extent of such 
incentives is not widely known. 

No agricultural production takes place in the Antarctic. Resource 
extraction (fisheries) supplies consumers outside of the region. 
Changing climates are altering both marine and terrestrial systems. 

A substantial fishery exists, overseen using an ecosystem approach by 
the CCAMLR. Much concern exists about the sustainability of the 
fishery?°?8. 

No area is currently under production. 

Point source pollution is limited with much effort to reduce it. 
Non-indigenous species are a major concern for the region, and area 
priority for the CEP. Activity to control species and pathways is 
inconsistent and uncoordinated. 

Ocean acidification is a substantial threat to Southern Ocean 
ecosystems?7107:108, 

Only 1.5% of terrestrial biodiversity area is formally protected. 
Management is inadequate®’. The designation of MPAs is controversial. 
Knowledge of the conservation status of taxa is poor, other than for most 
birds and mammals. 

Largely not relevant, although bioprospecting is increasing?®°. Southern 
Ocean ecosystems are important for whales, which have cultural value. 
Antarctic ecosystems provide several services essential to planetary 
stability, including maintenance of land ice to prevent sea level rise, and 
climate regulation. 

Not relevant to terrestrial systems. Its likely relevance is through marine 
system contributions to CO2 sequestration. 

Bioprospecting takes place widely in the region, with no agreement on 
access and benefit sharing!©. 

Most of the nations with activities in the region are party to the CBD and 
have national biodiversity strategy and action plans. The Antarctic region 
is seldom considered in them. No comparable regional plan exists. 

o indigenous or local communities in the region are recognized by the 
Antarctic Treaty System. 

Scientific activity is supported by many national programmes, 
communicated inter alia via the Scientific Committee on Antarctic 
Research. 

o mechanisms exist for mobilizing financial resources in support of a 
strategic plan for biodiversity in the region. 


The Antarctic region is largely an international space governed pre- 
dominantly through the Antarctic Treaty (which applies to all areas 
south of 60° S), to which 52 countries are party. Environmental matters 
are the main concern of its Committee for Environmental Protection 
(CEP) and the Convention on the Conservation of Antarctic Marine 
Living Resources (CCAMLR)”. In consequence, the region is typically 
not subject to many global agreements such as the CBD. Nonetheless, 
many of the Antarctic Treaty Parties are also signatory to these inter- 
national agreements, including the CBD, and are actively developing 
plans to achieve the Aichi Targets”’. 

The Aichi Targets encompass 20 aspirational goals for global 
biodiversity conservation. Not all of them are immediately relevant 
to the Antarctic. For example, Targets 7, 13 and 18 are not germane 
because the activities and species they cover, and indigenous human 
populations, are typically absent from the region. Many of the 
targets could, however, be applied to the Antarctic (Table 1). Of 
these, only performance against Target 11, concerning protected 
areas, has been explicitly assessed”. Just 1.5% of the continent’s 
ice-free area is formally designated as specifically protected, and 5 
of its 15 ecoregions have no protection. Antarctic terrestrial biodi- 
versity protection thus falls short of the 17% area coverage proposed 
in Target 11, and by some accounts management is insufficiently 
effective too’’. Similar assessments have not been made for marine 
protected areas (MPAs), but only one exists in the Treaty Area. 
Together with recent controversy about the inability of the parties 
to CCAMLR to agree on Antarctic MPAs, this situation suggests 
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that for marine systems, area protection is poorer than for the 
terrestrial Antarctic. 

For the remainder of the relevant Aichi Targets, the available evid- 
ence suggests that were assessments to be made, the region might fare 
poorly in all except perhaps Targets 8 and 9 (Table 1). The reduction 
of pollution to levels that are not detrimental to ecosystem function 
and biodiversity is the subject of Target 8. Much progress has been 
made in addressing pollution, though in some areas substantial 
problems still exist’. Target 9 specifies that by 2020, invasive alien 
species and pathways of introduction should be identified and prior- 
itized, priority species controlled or eradicated, and measures put in 
place to prevent the introduction and establishment of such species. 
The Antarctic Treaty Parties have taken meaningful, evidence-based 
steps to identify pathways of invasion and priority species, and to 
prevent introduction and establishment, based on the advice of 
the CEP”*. 


Outlook and challenges 


The importance of science in and about the Antarctic region has recently 
been re-emphasized by the identification of 80 key questions that are on 
the scientific horizon for the region'”. Many of these questions illustrate 
the major role science will play in mitigating the consequences of envir- 
onmental impacts in the Antarctic and in providing the information 
required for evidence-based policy to do so both locally and globally. 
Perhaps more importantly from a fundamental science perspective, they 
also recognize that human activities in the Antarctic and elsewhere may 
be compromising current approaches and future ability to understand 
biodiversity and ecosystem functioning in the region. 

From this perspective and from what is now known about biodiver- 
sity in the region, several challenges remain. First among these is inad- 
equate knowledge of terrestrial diversity in many areas, for many taxa, 
and the absence of any coordinated program to rectify the situation. In 
the face of substantial activity across the region, and the clear indication 
of regional diversification in many groups, signals of phylogenetic and 
functional distinctiveness, especially in the microbiota, may be substan- 
tially compromised. Risks of introduction of non-indigenous taxa are 
not only from outside the continent, but also from regions that are 
significantly differentiated on the continent, at both continent-wide 
and local scales”*. Comprehensive understanding of how diversity pat- 
terns and the mechanisms underlying them vary among major groups, 
and among major areas (such as above and below about 70° S), is yet to 
be developed, restricting comparisons with elsewhere and with marine 
systems. Biotic homogenization as a consequence of both scientific and 
non-scientific activities would certainly obscure understanding, though 
to what extent this might happen is itself unknown. 

The importance of limiting local impacts from human activities has 
been further emphasized by recent work showing that responses to 
climate change, and in particular to pulse events, may be rapid’®. 
Increases in water availability and carbon as a result of warmer, wetter 
summers, and progressively warming regional climates as seen in the 
Peninsula’, could radically alter biodiversity in the continent’s ice-free 
areas on a near-immediate time scale, such as by increasing growth 
rates and activity periods, so increasing abundance and richness. 
Understanding the likelihood of such change and documenting it pre- 
supposes a limited influence of local disturbance. 

A similar situation is true of marine systems (Box 1), where 
understanding of the functioning and dynamics of large, relatively 
undisturbed ecosystems remains an important goal of ecology. Until 
recently, this goal may well have been possible for the Antarctic given 
low impacts on the Ross Sea system, acknowledging the roles played by 
whales and seals that were removed in large numbers in the 19th and 
20th centuries”’. Now, that possibility is under threat. Changes in strato- 
spheric ozone and greenhouse gases are driving substantial alterations to 
sea ice concentrations and seasonality, and ocean pH in the system”, 
and little progress has been made in securing the area from the local, and 
considerable, impacts of fishing”*®. Establishing a substantial MPA for 
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the Ross Sea is not only important for conservation, but also for the 
scientific value of retaining a large marine system that is not being 
simultaneously affected by both global anthropogenic drivers and local 
ones. Few such other marine systems exist. More generally, the situation 
reflects the absence of substantial national collaboration to achieve 
region-wide conservation objectives. No commitment to anything equi- 
valent to the Aichi Targets exists for the region, and the Antarctic Treaty 
System remains underfunded, with new activities typically not reaching 
consensus because of budgetary concerns. In the face of growing chal- 
lenges presented by fishing and interest in the Antarctic’s other 
resources”, the situation is especially concerning. 

Rapid developments in a range of areas nonetheless suggest that, from 
a science perspective, many of the challenges can be overcome. These 
include new DNA archival approaches that enable samples collected 
from rare visits to remote sites to be stored and re-investigated repeat- 
edly’, satellite remote sensing that enables total population counts'” 
and identification of areas with much metazoan diversity'®, and new 
analytical tools to enable comparisons among multiple groups and 
regions’. Approaches that limit the influence of scientific activities 
on the very systems being investigated are also developing rapidly”’. 
The application of these developments will mean a larger transforma- 
tion in understanding of Antarctic biodiversity over the next 20 years 
than its change over the last century. In so doing investigations of a 
globally unusual and important system will continue to contribute to 
general understanding of the planet’s diversity and ecosystems, and 
at the same time provide further evidence to support its stringent 
conservation. 
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Competitive binding of antagonistic 
peptides fine-tunes stomatal patterning 


Jin Suk Lee’?, Marketa Hnilova*, Michal Maes’, Ya-Chen Lisa Lin’?, Aarthi Putarjunan’, Soon-Ki Han!?, 


Julian Avila’? & Keiko U. Torii»? 


During development, cells interpret complex and often conflicting signals to make optimal decisions. Plant stomata, the 
cellular interface between a plant and the atmosphere, develop according to positional cues, which include a family of 
secreted peptides called epidermal patterning factors (EPFs). How these signalling peptides orchestrate pattern 
formation at a molecular level remains unclear. Here we report in Arabidopsis that Stomagen (also called EPF-LIKE9) 
peptide, which promotes stomatal development, requires ERECTA (ER)-family receptor kinases and interferes with the 
inhibition of stomatal development by the EPIDERMAL PATTERNING FACTOR 2 (EPF2)-ER module. Both EPF2 and 
Stomagen directly bind to ER and its co-receptor TOO MANY MOUTHS. Stomagen peptide competitively replaced EPF2 
binding to ER. Furthermore, application of EPF2, but not Stomagen, elicited rapid phosphorylation of downstream 
signalling components in vivo. Our findings demonstrate how a plant receptor agonist and antagonist define 
inhibitory and inductive cues to fine-tune tissue patterning on the plant epidermis. 


Development and pattern formation of multicellular organisms rely on 
diffusible signals that instruct cells to adopt a specific fate for optimal 
function, and hence organismal fitness. Often such signals are encoded 
by multiple gene families, which raises the question of how a given cell 
orchestrates the decision-making process. For instance, a family of 
secreted signals, such as FGFs, are used in an iterative manner to 
specify multiple, diverse developmental processes in animals’. While 
peptide signalling has recently emerged as a critical regulator of plant 
development’, how specific members of plant peptide families share 
and distribute functions remains unclear. Patterning of stomata— 
valves on the plant epidermis that mediate carbon dioxide acquisition 
and water control—relies on cell-cell communication, which specifies 
a subset of seemingly uniform protodermal cells to acquire stomatal 
progenitor fate. Two secreted cysteine-rich peptides, EPF1 and EPF2, 
are expressed in later and earlier stages of stomatal precursors, respect- 
ively, and are perceived by the cell-surface receptors, ER-family 
leucine-rich repeat receptor kinases (LRR-RKs)—ER, ER-LIKE1 
(ERL1) and ERL2—to inhibit stomatal development*”’. The recep- 
tor-like protein TOO MANY MOUTHS (TMM) modulates the sig- 
nalling strengths of ER-family receptor kinases in a region-specific 
manner®*. Genetic evidence suggests that the signals are mediated 
via a mitogen-activated protein kinase (MAPK) cascade, which even- 
tually downregulates the transcription factor responsible for initiating 
stomatal lineage via direct phosphorylation”. 

Recently, EPF-LIKE9 (EPFL9) peptide, also known as Stomagen, was 
identified as a positive regulator of stomatal development, a role oppos- 
ite to EPF1 and EPF2 (refs 13-17). Structural modelling of the EPF/ 
EPFL-family peptides using the NMR-solved structure of Stomagen 
predicts that they all adopt related structures’®. This raises the question 
of how can structurally related peptides confer completely opposite 
developmental responses. The molecular mechanism for Stomagen 
action remains unknown. 


Stomagen acts downstream of the ER family 


To place Stomagen into a genetic framework of the core stomatal sig- 
nalling pathway, we first examined the effects of induced STOMAGEN 


overexpression (iSTOMAGEN) on er erll erl2 triple mutant phenotypes 
by an oestradiol-induction system or co-suppression by artificial 
microRNA (STOMAGEN-ami) (Fig. 1 and Extended Data Figs 1-4). 
As previously reported’**, ectopic iSTOMAGEN expression resulted in 
an increase in stomatal density (number of stomata per mm”), stomatal 
index (percentage of stomata per total number of stomatal and non- 
stomatal epidermal cells) and severe stomatal clustering in wild-type 
cotyledon epidermis (Fig. la, b, k and Extended Data Figs 1-3). In 
contrast, iSTOMAGEN had no effect on stomatal density, stomatal 
index, or stomatal clusters in er erll erl2 cotyledons, just like in tmm 
(Fig. 1 and Extended Data Fig. 3)'*"*, suggesting that STOMAGEN and 
the ER family act in the same pathway. 

As reported, STOMAGEN-ami lines markedly reduced stomatal 
development in wild-type cotyledons (Fig. la, c, j and Extended 
Data Fig. 4)"*. In contrast, STOMAGEN-ami had no effect on stomatal 
density, stomatal index, and stomatal clustering phenotype of er 
erll erl2 cotyledons, just like tmm (Fig. 1 and Extended Data Fig. 4). 
Thus, ER-family receptor kinases are required for the hypermorphic 
and hypomorphic effects of Stomagen. The epistasis of er erll erl2 
stomatal cluster phenotype over the phenotype of STOMAGEN-ami 
places the ER family downstream of STOMAGEN, consistent with the 
molecular identity of their gene products as receptor kinases and a 
secreted peptide. 


Genetic dissection of Stomagen action 

To dissect the role of Stomagen on the TMM and ER module, we 
comprehensively investigated the effects of iSTOMAGEN on stomatal 
differentiation in tmm hypocotyls with additional er-family muta- 
tions (Fig. 2 and Extended Data Fig. 5). In hypocotyls, TMM and 
ER family have opposite functions: tmm hypocotyls lack stomata’, 
whereas er erll erl2 hypocotyls produce stomatal clusters’*. While 
tmm is epistatic to er single mutation in hypocotyls, consecutive 
loss of ER-family genes reverts stomatal development in a dosage- 
dependent manner, with er erll erl2 being epistatic to tmm (ref. 7). 
iSTOMAGEN does not confer stomatal differentiation in tmm 
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Figure 1 | Complete loss of ER-family genes confers insensitivity to 
STOMAGEN overexpression and co-suppression. a-i, Representative 
confocal images of cotyledon abaxial epidermis from 10-day-old light-grown 
seedlings of wild type (a-c), tmm (d-f) and er erll erl2 (g-i), with induced 
STOMAGEN overexpression (iSTOMAGEN; b, e, h) or STOMAGEN-ami 
construct (c, f, i). Uninduced controls show no effects (see Extended Data 
Figs 1-3). Images were taken under the same magnification. Scale bar, 30 um. 
n = 13 (a);n = 18 (b); n = 26 (c); n = 16 (d); n = 24 (e); n = 26 (f);n = 16 
(g);n = 24 (h); n = 12 (i). j, Stomatal index. —, control; ami, STOMAGEN-ami. 
Box, mean; bars, s.e.m. ***P < 0.005 (Wilcoxon rank sum test). NS, not 
significant (P = 0.653 for tmm; P = 0.539 for er erll erl2). n = 8 for each 
genotype. k, Stomatal index. —, uninduced; iSTOM, induced. Box, mean; bars, 
s.e.m. ***P < 0,005 (Wilcoxon rank sum test). NS, not significant (P = 0.114 
for tmm; P = 0.688 for er erll erl2). No induction, n = 16; iSTOM, n = 14; 
tmm no induction, tmm iSTOM, er erll erl2, er erll erl2 iSTOM, n = 15 for 
each genotype. For the total numbers of stomata counted, see legends for 
Extended Data Figs 3 and 4. 


hypocotyls’’. However, in some instances arrested stomatal precursor 
cells (stomatal-lineage ground cells (SLGCs)) were observed, indi- 
cating that, in the absence of TMM, iSTOMAGEN could initiate 
stomatal development in hypocotyls (Fig. 2a, b and Extended Data 
Fig. 5c, d). Additional er-family mutations exaggerated this effect; that 
is, iSTOMAGEN in tmm er and tmm erl2 hypocotyls, both of which 
lack stomata, resulted in SLGC clusters (Fig. 2c, d and Extended Data 
Fig. 5e-h). iSTOMAGEN triggered stomatal cluster formation in tmm 
erll, tmm erll erl2 and tmm er erll mutant hypocotyls, while intensi- 
fying stomatal entry divisions in tmm er erl2 hypocotyls (Fig. 2e, f and 
Extended Data Fig. 5i-p). Different effects of iSTOMAGEN on the 
higher-order mutants lacking ER (for example, tmm er and tmm er 
erl2) compared with those lacking ERL1 (for example, tmm erll and 
tmm erll erl2) reflect the overlapping yet unique roles of ER and ERL1 
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Figure 2 | STOMAGEN overexpression on stomatal development in tmm 
hypocotyl epidermis with combinatorial loss-of-function in ER-family 
genes. a-h, Representative confocal microscopy images of hypocotyl 
epidermis from 10-day-old light-grown transgenic Est::STOMAGEN 
(oestradiol-induced STOMAGEN) seedlings of tmm (a, b); tmm er (c, d); tmm 
erll erl2 (e, f); and tmm er erll erl2 (g, h). A control, uninduced phenotype 
(a, ¢, e, g) and iSTOMAGEN phenotype (b, d, f, h) is shown. iSTOMAGEN 
results in arrested stomatal precursor cells (asterisk) and stomatal-lineage 
ground cells (SLGCs (bracket)) in tmm hypocotyls (b). Additional er mutation 
exaggerated this effect (d), while additional erll erl2 mutations increased 
stomata (f). Images were taken under the same magnification. Scale bar, 30 jum. 
n = 20 (a); n = 20 (b); n = 19 (c); n = 22 (d); n = 17 (e); n = 20 (f); n = 20 
(g), n = 20 (h). For a complete set of higher-order mutant phenotypes and 
quantitative data, see Extended Data Fig. 5. 


in stomatal development’. Finally, iSTOMAGEN failed to enhance the 
severe stomatal clustering phenotype in tmm er erll erl2 mutants 
(Fig. 2g, h and Extended Data Fig. 5q, r). Quantitative analysis of 
stomatal index and SLGC index (the percentage of SLGCs in total 
epidermal cells) supports these findings (Extended Data Fig. 5s, t). 
Together, the results suggest that in the hypocotyls, where TMM and 
ER-family act antagonistically, Stomagen primarily acts via all three 
ER-family receptor kinases. 

Among the ER family, ER primarily perceives EPF2 to restrict 
initiation of stomatal cell lineages, while ERL1 primarily perceives 
EPF1 to orient stomatal spacing and prevent guard cell differenti- 
ation®. As such, epf2 increases SLGCs, whereas epf1 violates stomatal 
spacing’ °. Neither epf2 nor epfl confers severe stomatal clustering 
phenotype like iSTOMAGEN, since only a subset of ER-family- 
mediated pathways has been compromised®. We delineated the role 
of Stomagen in each of these steps. We first examined whether EPF1, 
EPF2 and STOMAGEN transcripts are under feedback regulation, 
which may complicate the genetic analyses. EPF1 and EPF2 transcript 
levels were slightly upregulated by iSTOMAGEN, and conversely, 
slightly downregulated by STOMAGEN-ami (Extended Data Fig. 
2c, d). However, the endogenous STOMAGEN transcript levels are 
unaffected by epf1, epf2, or epfl epf2 (Extended Data Fig. 2d). Thus, 
altered expression of EPF1 and EPF2 by STOMAGEN misregulation 
probably reflects the numbers of stomatal-lineage cells’*"*. 

EPF2-ER or EPF1-ERL1 signalling pathways with iSTOMAGEN 
resulted in severe stomatal clusters, indicating that excessive 
Stomagen promotes stomatal differentiation when either pathway is 
compromised (Extended Data Fig. 3). These genetic data support 
the notion that Stomagen, when ectopically overexpressed, can 
bind to all ER-family receptor kinases and inhibit signal transduc- 
tion. Indeed, co-immunoprecipitation experiments using Nicotiana 
benthamiana microsomal fraction expressing green fluorescent pro- 
tein (GFP)-fused ectodomains of ER, ERL1, ERL2 or TMM incubated 
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Figure 3 | Direct and competitive binding of Stomagen and EPF2 peptides 
to ER. a—c, QCM analysis for direct binding. a, b, The averages of experimental 
frequency shift values recorded from two to four independent experiments 
for Stomagen (a) or mEPF2 (b) onto biosensor chips functionalized with 
ER(AK)-GFP (red), TMM-GFP (blue) and GFP alone (grey) and fitted to the 
Langmuir adsorption model using least square regression. Error bars indicate 
s.d. Stomagen-ER, n = 3; Stomagen-TMM, n = 2; Stomagen-GFP, n = 3; 
mEPF2-ER, n = 2; mEPF2-TMM, n = 3; mEPF2-GFP, n = 2.c¢, The average 
experimental frequency shift values recorded for LURE2 (dark grey) and 
mutant Stomagen (light grey) on ER(AK)-GFP. To calculate the dissociation 
constant (Kg) values, the ligand concentrations were increased to 1 [1M to 
obtain fitted curves. See Extended Data Fig. 9 for raw recording data. Inactive 
mutant Stomagen, in which six cysteines were substituted with serines 
(Stomagen_C6—S)-ER, n = 3; LURE2-ER, n = 2. Error bars indicate s.d. 
Right insets: wild-type cotyledon epidermis treated with 2.5 uM mutant or 


with synthetic Stomagen peptides demonstrated that Stomagen 
associates with all ER-family receptor kinases and TMM (Extended 
Data Fig. 6a). 

Unlike overexpression, Stomagen co-suppression imposed differ- 
ent effects on EPF2-ER and EPFI-ERLI signalling pathways. 
STOMAGEN-ami suppressed the stomatal-pairing phenotype of 
epfl and dominant-negative, kinases-deleted ERL1 (ERL1(AK)) erll 
(Extended Data Fig. 4g-j, m). In contrast, STOMAGEN-ami exhibited 
complex interactions with epf2 and dominant-negative ER (ER(AK)) 
er, reducing numbers of stomata but not that of SLGCs (Extended 
Data Fig. 4c—f, k-n). This supports the idea that Stomagen counteracts 
EPF2 for ER-mediated stomatal initiation’*'*’®. This also suggests 
that, in the absence of both EPF2 and STOMAGEN, the default ER 
pathway is not activated while the later ERL-mediated pathway 
remains capable of repressing the differentiation of mature stomata. 


Competitive binding of EPF2 and Stomagen 

A series of genetic analyses leads to the possibility that Stomagen 
antagonizes EPF2’s action via direct binding to ER. To address this, 
we produced bioactive Stomagen and predicted mature EPF2 
(mEPF2) peptides (Extended Data Figs 7 and 8). Subsequently, their 
direct binding to ER as well as to TMM was tested using previously 
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bioactive Stomagen. Scale bars, 30 1m. n = 8 for each treatment. For a—c, each 
experimental point in independent binding experiment (referred to as ‘n = 1’) 
is generated from multi-point raw data (10-20 points) with average and s.d. 
values. d, Competitive binding. Microsomal fractions expressing ER(AK)-GFP 
were incubated with 1 1M of bioactive mEPF2 with increasing concentrations 
of bioactive Stomagen and subjected to immunoprecipitation. The mEPF2— 
MYC-His blot was re-probed with anti-Stomagen antibody. Asterisk indicates 
most likely isomer. e, Quantitative analysis of competition from four 
biological replicates. Error bars, s.e.m. The ICs9 value is substantially higher 
than the Kg values for Stomagen-ER or mEPF2-ER, presumably owing to the 
immunoblot-based quantification. f, Wild-type cotyledon epidermis treated 
with mEPF2 alone or simultaneously co-treated with mEPF2 and increasing 
concentrations of Stomagen for 5 days. n = 3 for each treatment. Images were 
taken under the same magnification. Scale bar, 50 UM. 


established quartz crystal microbalance (QCM) biosensor platforms 
(Fig. 3a, b and Extended Data Fig. 9)°. Briefly, we immobilized puri- 
fied GFP-fused receptors or control GFP from N. benthamiana on 
gold surfaces of QCM chips via anti-GFP antibody and then intro- 
duced the bioactive Stomagen or mEPF2 peptide solutions. The 
peptide-receptor binding was recorded as a function of frequency 
change (see Methods)°. Both Stomagen and mEPF2 exhibited sat- 
urable binding to the ER ectodomain fused to GFP (ER(AK)-GFP) 
with similar dissociation constants at a nanomolar range (Fig. 3a, b 
and Extended Data Fig. 9). Additionally, Stomagen and mEPF2 
bound to TMM with high affinity (Fig. 3a, b). No significant binding 
of Stomagen or mEPF2 to control GFP was detected (Fig. 3a, b and 
Extended Data Fig. 9). To address the specificity of peptide-receptor 
interactions, two control peptides were subjected to the QCM analysis 
using ER(AK)-GFP-functionalized chips: (1) non-folding, inactive 
mutant Stomagen, in which six cysteines were substituted with serines 
(Extended Data Fig. 8g)'®; and (2) LURE2, an unrelated cysteine-rich 
peptide, which acts as a pollen-tube attractant”. Neither mutant 
Stomagen nor LURE2 exhibited binding above the background level 
(Fig. 3c). Consistently, LURE2 did not associate with ER, TMM, or an 
innate immunity receptor FLS2 (refs 21, 22) fused with GFP expressed 
in N. benthamiana in co-immunoprecipitation assays (Extended Data 
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Figure 4 | EPF2, but not Stomagen, triggers downstream MAPK activation 
in Arabidopsis seedlings. a, b, Differential MAPK activation in Arabidopsis 
wild-type seedlings treated with buffer only (a, mock), mEPF2 (a, b), Stomagen 
(a), and heat-denatured mEPF2 (b) for respective time intervals (min). The 
blots were probed with anti-phospho-ERK antibody (anti-pERK) to detect 
phosphorylated MPK6 (pMPK6) and pMPK3 upon peptide treatment. 
Asterisk indicates non-specific band. CBB, total proteins stained. Four and two 
biological replicates were performed for a and b, respectively. c, Confocal 
microscopy of Arabidopsis wild-type cotyledon abaxial epidermis treated with 
heat-denatured mEPF2 (top) and control, non-denatured mEPF2 (bottom). 
Scale bar, 40 tum. n = 3 for each treatment. 


Fig. 6b). Similarly, FLS2 failed to immunoprecipitate Stomagen above 
background levels (Extended Data Fig. 6c). Together, the results 
emphasize the specificity of Stomagen-ERECTA and Stomagen- 
TMM interactions. 

Next, we performed ligand competition assays between Stomagen 
and EPF2. Microsomal fractions from N. benthamiana expressing 
ER ectodomain (ER(AK)-GFP) were incubated with bioactive epi- 
tope-tagged mEPF2 (mEPF2-MYC-HIS; 1 WM) and increasing 
concentrations of bioactive Stomagen peptides (0-23.4 UM) followed 
by immunoprecipitation of ER. Co-immunoprecipitated epitope- 
tagged mEPF2 was detected first. Then, the same blot was re-probed 
with anti-Stomagen antibody to detect co-immunoprecipitated 
Stomagen. Increasing concentrations of Stomagen peptide replaced 
mEPF2 for ER binding (Fig. 3d). Quantitative analysis confirmed 
the competitive binding of Stomagen and mEPF2 to ER, with a 
half-maximum inhibitory concentration (IC59) value of 454 nM 
(Fig. 3e). Our results demonstrate that Stomagen and EPF2 peptides 
directly compete for binding to the same receptor, ER. Application of 
mEPF2 to wild-type seedlings inhibited stomatal development, while 
simultaneous treatment of mEPF2 with increasing concentration of 
Stomagen in a similar concentration range used in the competition 
experiments resulted in increased stomatal differentiation (Fig. 3f). 
The results are consistent with a previous report'® and further 
emphasize the in vivo biological relevance of peptide competition. 


Activation of downstream signalling 


To unravel the mechanism of Stomagen as a competitive antagonist of 
EPF2, we examined the activation of downstream signalling, specif- 
ically, using phosphorylation of MPK3 and MPK6as readout. Genetic 
studies suggest that EPF2—-ER ligand-receptor signalling acts via a 
MAPK cascade*"”. However, a recent report of co-expressed stomatal 
signalling components in N. benthamiana failed to detect MPK6 
activation by EPF2 (ref. 17), probably due to a limitation of the het- 
erologous co-expression system for capturing fast and transient res- 
ponse. We therefore tested MAPK activation in vivo using 
Arabidopsis seedlings. Application of mEPF2 peptide to Arabidopsis 
wild-type seedlings rapidly elicited phosphorylation of MPK3 and 
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MPK6 in 10 min, a characteristic signature of MAPK activation, 
which declined after 2 h (Fig. 4a, b). Heat-induced denaturation of 
mEPF2 greatly diminished MAPK phosphorylation, correlating with 
its loss of bioactivity (Fig. 4b, c). By contrast, Stomagen peptide treat- 
ment failed to trigger MAPK phosphorylation (Fig. 4a). We conclude 
that EPF2 activates ER signalling, leading to subsequent MAPK 
activation to inhibit stomatal development, while Stomagen prevents 
the signal transduction. 


Discussion 


Our work elucidates the competitive binding of Stomagen and EPF2 
to ER as a molecular mechanism optimizing stomatal patterning. 
Plant genomes possess large numbers of peptide gene families, the 
functions of which remain largely unknown”. The concept of fine- 
tuning signal transduction by related endogenous peptides that 
assume opposing functions may extend to other peptide families. 
EPF2 is expressed in a subset of protodermal cells, while Stomagen 
is secreted from an underlying internal tissue**'*'*. Thus, it seems 
plausible that a protodermal cell might respond to differences in 
intrinsic concentrations of EPF2 and Stomagen on each neighbouring 
side. It remains to be tested whether local concentrations of Stomagen 
in the apoplast reflect the IC; values we have determined biochemi- 
cally (Fig. 3e). The complex effects of STOMAGEN overexpression on 
a series of er-family mutants in the tmm background (Fig. 2) resemble 
that of challah (chal) higher-order mutants, which lack EPFL4 
and EPFL6 peptides, another set of ER ligands promoting stem 
growth***, This raises the possibility that complex fine-tuning of 
multiple EPF-family peptides may occur at developmental contexts 
far beyond stomatal patterning. Quantitative visualization of each 
peptide in vivo during epidermal development, as well as precise 
documentation of the dose-response effects of simultaneous mixed 
peptide applications of wide concentration gradients, may reveal the 
signalling complexity at the level of ligand-receptor association. EPF2 
and Stomagen bind to ER and TMM with a similar affinity (Fig. 3), 
suggesting the formation of co-receptor complexes, a hallmark of 
receptor activation and signal transduction in plant LRR-RKs in 
development and innate immunity response*””*. Future structural 
and cell biological studies may reveal the intricacy behind how a cell 
interprets conflicting signals to make decisions during developmental 
patterning. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Plant materials and growth conditions. The Arabidopsis accession Columbia 
(Col) was used as wild type. All plants used in this study are in the Col back- 
ground. The following mutants and transgenic plant lines were reported prev- 
iously: er-105, erl1-2, erl2-1 and their higher order mutants’; tmm-1 (ref. 8); epf1, 
epf2, epfl epf2 and tmm-KO (refs 3, 4); ER(AK) in er-105, ERL1(AK) in erl1-2 (ref. 6); 
and STOMAGEN-ami line (line 12; a gift from I. Hara-Nishimura)’*. The 
STOMAGEN-ami lines were introduced into various mutant or transgenic back- 
grounds via genetic crosses. Plants were grown as described previously”. For 
phenotypic analysis, seedlings were grown on 0.5 Murashige and Skoog (MS) 
media containing 1X Gamborg Vitamin (Sigma), 0.75% Bacto Agar, and 1% w/v 
sucrose under continuous light condition at 50 pmol s~' m~* (light intensity 
measured by LI-250A; LI-COR). 

Plasmid construction and transgenic plants generation. pKUT608 (pENTR- 
STOMAGEN) and pKMP127 (Est::STOMAGEN) were generated. See Extended 
Data Table 1 for plasmid and primer sequence information. Transgenic 
Arabidopsis plants were generated by the floral dip method*’. Multiple transgenic 
lines per construct were subjected to phenotypic characterization and represent- 
ative lines (three lines if lines were established, and 12-14 lines if T1 lines were 
used) were used for quantitative analyses. The Est::STOMAGEN lines were intro- 
duced into various mutants or transgenic backgrounds via genetic crosses. 
Chemical induction of transgene. Transgenic Arabidopsis seedlings carrying 
Est::STOMAGEN was germinated on 0.5 MS medium supplemented with 10 uM 
oestradiol (Sigma). Induction of STOMAGEN gene expression (iSTOMAGEN) was 
confirmed by RT-PCR (see Extended Data Fig. 2). The induction was further 
confirmed by observing the epidermal phenotypes of cotyledons and hypocotyls 
using a confocal microscope. 

RT-PCR analysis. RNA extraction, cDNA synthesis and RT-PCR were per- 
formed as previously described*’. For a list of primers, see Extended Data Table 1. 
Histology, microscopy and image analysis. Confocal microscopy images were 
taken using either Zeiss LSM700 operated by Zen2009 (Zeiss) described 
previously® or Leica SP5-WLL operated by LAS AF (Leica). Cell outlines were 
visualized with either propidium iodide (PI: Molecular Probes) or FM4-64 
(Invitrogen) and observed using the HyD detector with excitation 515 nm, emis- 
sion 623-642 nm. The images were false coloured using Photoshop CS6 (Adobe). 
Clearing of seedlings by chloral hydrate and observation using differential inter- 
ference contrast (DIC) microscope was performed as described previously”. For 
histological analysis, seedlings were stained with toluidine blue-O (TBO: Sigma) 
as follows. Briefly, samples were placed in 9:1 v/v ethanol to acetic acid overnight, 
rehydrated through reduced ethanol series to deionized water, then stained with 
0.5% TBO for 3 min. Seedlings were immediately rinsed with deionized water and 
subsequently mounted in 15% v/v glycerol. For bright-field and DIC microscopy, 
images were taken under Olympus BX51 equipped with DP73 digital camera 
operated by CellSens Standard software (Olympus). 

Quantitative analysis of epidermis. Abaxial cotyledons from 10-day-old seed- 
lings of respected genotypes were subjected to TBO staining or DIC microscopy. 
The central regions overlying the distal vascular loop were imaged and numbers 
of epidermal cells, stomata and their cluster size were quantified. For each geno- 
type, sample size of 14-16 was used and over 1,000 epidermal cells were counted 
to provide statistical robustness. For cotyledons of Est::STOMAGEN lines, indi- 
vidual T1 seedlings were subjected to analysis. For hypocotyls, three represent- 
ative T2 Est::STOMAGEN lines were analysed. For each seedling, a representative 
image was taken at the exact location to minimize the variance. Specific numbers 
of stomata are listed for each genotype in corresponding figure legends. Statistical 
analysis (Wilcoxon rank sum test) was performed using R version 3.0.3 for 
stomatal density, stomatal index and SLGC index. P values are indicated in each 
figure legend. 

Transient protein expression in Nicotiana benthamiana. Agrobacterium tume- 
faciens strain GV3101 was transformed with expression clones and grown in yeast 
extract and beef medium supplemented with relevant antibiotics. Bacterial cul- 
tures were precipitated and resuspended in infiltration medium (10 mM MgCh, 
10 mM MES (pH 5.6) and 150 uM acetosyringone). Culture densities were 
adjusted to an ODgoo of 1.0, and the cells were incubated at room temperature 
for 4 h before infiltration. Equal volumes of cultures carrying different constructs 
were mixed. To enhance transient expression in tobacco, the silencing suppressor 
p19 (a gift from D. Baulcombe) was co-infiltrated*’. The bacterial suspensions 
were infiltrated into young but fully expanded leaves of N. benthamiana plants. 
After infiltration, plants were cultivated at 25°C and collected for further bio- 
chemical assays after 48-72 h. 

Peptide expression, purification and refolding. Recombinant mEPF2 peptide 
was prepared as reported previously® and the mature Stomagen peptide, either 


wild-type or non-refolding mutant version in which all six cysteines were sub- 
stituted by serines, were chemically synthesized (Invitrogen and BioSynthesis). 
The Stomagen peptide was dissolved in 20 mM Tris-HCl, pH 8.8, and 50 mM 
NaCl and refolded (Mini dialysis kit, MWCO:1,000, GE Healthcare) for 3 d at 
4°C using glutathione (reduced and oxidized forms; Wako) and L-arginine ethyl 
ester dihydrochloride (Sigma). The peptides were further dialysed three times 
against 50 mM Tris-HCl, pH 8.0 for 1.5 d to remove glutathione. For non-folding 
mutant Stomagen, chemically synthesized peptides were dissolved in 50 mM 
Tris-HCl, pH 8.0. The resulting mEPF2, Stomagen and mutant Stomagen pep- 
tides were fractionated using C18 column (Gemini) by HPLC (Waters Delta 
Prep 3000 HPLC) as previously described to determine the purity of each pep- 
tide’. The separated peaks were collected, and each peak was identified by 
MALDI-TOF mass spectrometry on an Autoflex II mass spectrometer in positive 
ion mode (Bruker Daltonics) using 2:1 a-cyano-4-hydroxycinnamic acid and 2,5- 
dihydroxy-benzoic acid matrix. The collected HPLC-purified mEPF2 and 
Stomagen peptide peaks were freeze-dried, then re-dissolved to appropriate con- 
centration. Quantification of the active populations of peptides was determined 
using NanoDrop8000 (Thermo Scientific) using the following molar extinction 
coefficients: Stomagen, 5,960; EPF2, 6,460; LURE2, 23,950 mol! cm7!. For 
bioassays, freeze-dried peptides were re-dissolved to appropriate concentration 
in MS medium. For subsequent biochemical assays, the amounts of bioactive 
peptides were calculated from this quantification. 

Peptide bioassays. Refolded recombinant mEPF2 and Stomagen peptides in 
buffer were applied to 1-day-old Arabidopsis plants that had germinated on 
0.5 MS medium. After 5 d of further incubation in 0.5 MS liquid medium contain- 
ing each peptide (2.5-5 IM concentration), stomatal phenotypes of abaxial coty- 
ledon epidermis were determined by inspection with a confocal microscope as 
described previously®. 

Immunoprecipitation, protein gel electrophoresis and immunoblots. For 
immunoprecipitation and co-immunoprecipitation assays, N. benthamiana 
leaves expressing CaMV35S::ERAK-GFP, CaMV35S::FLS2(AK)-GEP, CaMV35S:: 
TMM-GFP, CaMV35S::GEP, or empty vector were subjected to protein prepara- 
tion (microsomal fraction enrichment for all except soluble GFP). Co-incubation 
with Stomagen (1 1M) or LURE2 peptides (1 4M) and immunoprecipitation 
procedure are described in the Ligand Competition Assays section below. 
Immunoprecipitation using either anti-GFP (Abcam ab290) antibodies and pro- 
tein gel immunoblot (western blot) analysis using anti-GFP (Invitrogen C163), 
anti-Flag (Sigma-Aldrich M2), anti-His (Qiagen anti-His5 34660), and anti- 
Stomagen (a gift from I. Hara-Nishimura)'* antibodies were performed as 
described previously®. As secondary antibodies, either goat anti-mouse (GE 
Healthcare NA931) or anti-rabbit IgG horseradish peroxidase-linked antibodies 
(Sigma A6154) were used at a dilution of 1:50,000. The protein blots were visua- 
lized using SuperSignal West Femto Maximum Sensitivity Substrate (Thermo 
Scientific). 

Quartz crystal microbalance. QCM measurements were performed using QCM- 
Z500 (KSV Instruments) and commercially available AT-cut polished QCM 
crystals with a fundamental resonant frequency of 4.95 MHz (International 
Crystal Manufacturing Co.) as reported previously®. The QCM crystal chips were 
pre-treated with anti-GFP antibody (Abcam ab290) to functionalize the chip 
surface. Subsequently, GFP-tagged receptors or GFP expressed in N. benthami- 
ana and extracted as a microsomal fraction (for ER(AK)—GFP and TMM-GFP) 
or a total fraction (GFP) were immobilized onto a QCM sensor chip via anti-GFP 
antibody linkage. The chips were washed with a phosphate buffer extensively. 
After establishing a stable baseline using phosphate buffer solution, purified 
bioactive mEPF2 or Stomagen peptides in the phosphate buffer was added step- 
wise to the QCM chamber. The frequency change for QCM was monitored until 
no further change was observed, indicating equilibrium. All experiments were 
performed at 4°C in stop-flow mode. The peptide-receptor binding was quan- 
tified via QCM by measuring the frequency shifts, AF, at several peptide concen- 
trations. To determine the dissociation constant (Kg) of each peptide-receptor 
pair, the experimental frequency shift values were fitted to the Langmuir adsorp- 
tion model: —AF = AFyaxC/(C + Kg), where AF ,ax is the frequency shift when 
the binding is saturated and C is the concentration of the bulk solution, using a 
least squares regression. 

Ligand competition assays. N. benthamiana leaves expressing CaMV35S:: 
ER(AK)-GFP were ground in liquid nitrogen and homogenized in extraction 
buffer (100 mM Tris-HCl pH 8.8, 150 mM NaCl, 1 mM EDTA, 20% glycerol, 
20 mM Naf, 1 mM PMSF, 1:1,000 Complete protease inhibitor cocktail (Roche 
Applied Science)). The slurry was centrifuged at 10,000g for 15 min at 4 °C. The 
supernatant was sonicated on ice and then centrifuged at 100,000g for 30 min at 
4°C to give a pellet of the microsomal fractions. The pellet was resuspended 
in membrane solubilization buffer (100 mM Tris-HCl at pH 7.3, 150 mM 
NaCl, 1 mM EDTA, 10% glycerol, 1% Triton X-100, 20 mM NaF, 1 mM 
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PMSF, 1:1,000 Complete protease inhibitor cocktail) to release membrane pro- 
teins. The solution was sonicated on ice and centrifuged again at 100,000g for 30 
min at 4°C. The supernatant was incubated with Protein-G-coupled magnetic 
beads (Dynabeads Protein G, Invitrogen) that captured anti-GFP (ab290; Abcam) 
antibody at 4 °C for 2 h with gentle agitation. Then, the beads were washed four 
times with 500 ul of phosphate buffer (pH 7.4). The immunoprecipitates were 
suspended in 500 jl of binding buffer (50 mM MES-KOH, pH 5.5 with 100 mM 
sucrose) containing 1 11M mEPF2-MYC-HIS peptide in the absence or presence 
of different concentration of unlabelled bioactive Stomagen peptide and then 
incubated at 4 °C for 1 h with gentle agitation. The reaction mixture was washed 
four times with 500 tl of phosphate buffer (pH 7.4) to separate bound and free 
mEPF2-MYC-His peptide, and precipitated proteins were eluted with 2 SDS 
sample buffer at 80 °C for 5 min. Either total membrane or immunoprecipitated 
proteins were separated on a SDS-PAGE gel and transferred to PD VF membrane 
(Millipore) for immunoblot analysis using monoclonal anti-GFP (C163, 1:1,000, 
Invitrogen), anti-MYC (ab32, 1:1,000, abcam) or anti-Stomagen antibodies (1: 
5,000, a gift from I. Hara-Nishimura)'* as primary antibodies. As secondary 
antibodies, either goat anti-mouse or rabbit IgG horseradish peroxidase-linked 
antibodies (GE Healthcare NA931; Sigma A6154) were used at a dilution of 
1:50,000. Co-immunoprecipitated mEPF2 was detected first. Then, the same blot 
was re-probed with anti-Stomagen antibody to detect Stomagen. 

Four biological replicates were performed and subjected to quantification of 
the ICs9 values as the following. Band intensities on western blots were quantified 
using IMAGE] (http://rsb.info.nih.gov/ij/index.html). Pixel values were mea- 
sured on equal-sized areas and normalized against the bands detecting 
same immunoprecipitates by monoclonal anti-GFP antibody. The intensity 
values shown in the paper are the ratios relative to the references, and values 
were analysed by nonlinear regression analysis using OriginLab version 6 
(OriginLab) to calculate the ICs9 value. 
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MAPK phosphorylation assays. 12-day-old Arabidopsis seedlings were grown 
for 5 days on 0.5 MS media plates and then transferred to 0.5 MS liquid media in a 
12-well cluster plate (Falcon 3047). Seedlings were treated with buffer only, 
mEPF2 (2.5 1M), or with Stomagen (5 11M) at room temperature before being 
pooled for harvest. For heat denaturation of mEPF2, the peptide solution was 
treated at 95 °C for 2 h before MAPK phosphorylation assays and bioassays. Plant 
materials were ground in liquid nitrogen, and then extracted with buffer (100 mM 
HEPES, pH 7.5, 5mM EDTA, 5 mM EGTA, 2 mM DTT, 10 mM Na3VOg,, 10 mM 
NaF, 50 mM -glycerolphosphate, 1 mM PMSF, 1 tablet per 50 ml extraction 
buffer of proteinase inhibitor mixture, 10% glycerol, 7.5% (w/v) PVPP). After 
centrifugation at 13,000 r.p.m. for 30 min, the protein concentration was deter- 
mined using a Bradford assay (Bio-Rad). Immunoblot analysis was performed 
using anti-phospho-ERK (1:2,000, Cell Signaling) antibody as primary antibody, 
and peroxidase-conjugated goat anti-rabbit IgG (1:15,000, Sigma) as secondary 
antibody. 


29. Shpak, E. D., Lakeman, M. B. & Torii, K. U. Dominant-negative receptor uncovers 
redundancy in the Arabidopsis ERECTA leucin-rich repeat receptor-like kinase 
signaling pathway that regulates organ shape. Plant Cell 15, 1095-1110 (2003). 

30. Clough, S.J. & Bent, A. F. Floral dip: a simplified method for Agrobacterium- 
mediated transformation of Arabidopsis thaliana. Plant J. 16, 735-743 (1998). 

31. Pillitteri,L.J., Peterson, K. M., Horst, R.J. & Torii, K.U. Molecular profiling of stomatal 
meristemoids reveals new component of asymmetric cell division and 
commonalities among stem cell populations in Arabidopsis. Plant Cell 23, 
3260-3275 (2011). 

32. Pillitteri, L. J., Sloan, D. B., Bogenschutz, N. L. & Torii, K. U. Termination of 
asymmetric cell division and differentiation of stomata. Nature 445, 501-505 
(2007). 

33. Voinnet, O., Rivas, S., Mestre, P.& Baulcombe, D. An enhanced transient expression 
system in plants based on suppression of gene silencing by the p19 protein of 
tomato bushy stunt virus. Plant J. 33, 949-956 (2003). 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


(no induction) + (ISTOMAGEN) 


~ x me 


Extended Data Figure 1 | Stomatal clustering phenotype of induced 
STOMAGEN overexpression in multiple independent transgenic lines. 
Shown are confocal microscopy images of abaxial cotyledon epidermis from 
10-day-old light-grown seedlings of four independent transgenic lines 
carrying an oestradiol-inducible STOMAGEN overexpression construct 
(iSTOMAGEN). Left panels, no induction (control); right panels, oestradiol 
induction; each row shows representative images from individual lines. Yellow 
brackets indicate stomatal clusters. Images are taken under the same 
magnification. Scale bar, 40 um. n = 3 for each panel. 
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Extended Data Figure 2 | RT-PCR analysis of STOMAGEN transcripts in 
transgenic lines used in this study. a, Expression of oestradiol-inducible 
STOMAGEN transgene (iSTOMAGEN) in transgenic lines expressing 
oestradiol-inducible STOMAGEN overexpression (Est::S TOMAGEN) lines 
from wild-type (wt), tmm and er erll erl2 triple mutant background with or 
without oestradiol induction. b, Expression of the endogenous STOMAGEN 
transcripts in each genotype carrying STOMAGEN-ami construct. tmm or er 
erll erl2 mutation does not seem to affect STOMAGEN transcript levels. 

c, Expression of EPF1, EPF2, total STOMAGEN and STOMAGEN transgene 
(iSTOMAGEN) transcripts in transgenic Est::STOMAGEN lines (in two 
different T1 populations (s1 and s2) and a representative T3 line (s3)) with or 
without oestradiol induction. STOMAGEN overexpression by oestradiol causes 
modest increase in EPF1 and EPF2 transcripts, which accords with increased 
stomatal differentiation by iSTOMAGEN. d, EPF1, EPF2 and STOMAGEN 
transcript accumulation in wild-type (wt) and single- and higher-order loss-of- 
function mutants of epfl, epf2 and stomagen (STOMAGEN-ami). For epf 
STOMAGEN-ami and epf2 STOMAGEN-ami lines, two different F3 
populations derived from the same genetic crosses were used to test the 
reproducibility. STOMAGEN expression is not influenced by epf1 and epf2 
mutations, consistent with the proto-mesophyll expression of STOMAGEN. 
However, EPF2 expression is reduced by STOMAGEN-ami, consistent with 
reduced stomatal cell lineages by STOMAGEN co-suppression. As reported, 
epfl has a T-DNA insertion within the 5’ UTR’, which results in accumulation 
of aberrant transcripts. For all experiments, elF4A was used as a control. For 
primer sequences see Extended Data Table 1. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


ARTICLE 


ept2 
ne induction 


epf2 
iSTOMAGEN. 


ERAK er 
nodnguction 


ERAK ef 
iSTOMAGEN, 


epfi 
no induction 


epf1 
iSTOMAGEN: 


ERL1AK\er7 
0 induction 


ERLAAK er? 
iSTOMAGEW- 


enerlt erl2 
no induction 


er erl1 erl2 
iSTOMAGEN 


Extended Data Figure 3 | STOMAGEN overexpression promotes stomatal 
differentiation in genetic backgrounds missing/blocking EPF2-ER and 
EPF1-ERL1 signalling components. a-j, Representative confocal images of 
cotyledon abaxial epidermis from 10-day-old light-grown transgenic seedlings 
of the following genotypes, each carrying Est::STOMAGEN construct: epf2 

(a, b); dominant-negative ER (ER(AK)) in er (c, d); epfl (e, f); dominant- 
negative ERL1 (ERL1(AK)) in erll (g, h); er erll erl2 (i, j). For each genotype, a 
control uninduced phenotype (a, ¢, e, g, i) and induced STOMAGEN 
overexpression (iSTOMAGEN) (b, d, f, h, j) are shown. Blocking ER or lacking 
EPF2 produces small stomatal-lineage cells due to excessive entry divisions 
(a, ¢ yellow brackets). iSTOMAGEN confers stomatal clusters and small 
stomatal-lineage cells are no longer present (b and d). Blocking ERL1 or lacking 
EPF1 causes a stomatal pairing due to a violation of one-cell-spacing rule 

(e, g; dots). iSTOMAGEN enhances stomatal cluster phenotype in these 
genotypes (f, h). iSTOMAGEN does not enhance stomatal clustering defects 
in er erll erl2 (i, j). Images were taken under the same magnification. Scale 
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(number of stomata per mm’) (k); stomatal index (% of number of stomata per 
stomata + non-stomatal epidermal cells) (1); and stomatal cluster distribution 
(in %) (m) from 10-day-old abaxial cotyledons of transgenic lines of each 
genotype carrying Est::STOMAGEN construct. —, no induction; +, induced by 
10 uM oestradiol. Stomagen overexpression significantly increases stomatal 
density in all genotypes except for er erll erl2 and tmm. Error bars indicate 
sem. ***P < 0,001; **P < 0.01; NS, not significant; Welch 2-sample t-test. 
Number of seedlings subjected to analysis, n = 14-16. Total numbers of 
stomata counted: wt, no induction 1,277, induction 2,639; epfl, no induction 
1,390, induction 3,485; ERL1(AK) erll, no induction 1,573, induction 3,991; 
epf2, no induction 2,502, induction 3,317; ER(AK) er, no induction 2,899, 
induction 4,397; tmm, no induction 2,948, induction 3,212; er erll erl2, no 
induction 4,454, induction 4,464. All genotypes carry Est::STOMAGEN. wt, 
no induction n = 16, induction n = 14; epf1, no induction n = 16, induction 
n = 17; ERL1(AK) erll, no induction n = 15, induction n = 15; epf2, no 
induction n = 15, induction n = 15; ER(AK) er, no induction n = 15, induction 


bars, 30 um. n = 29 (a); n = 24 (b); n = 16 (c);n = 17 (d); n = 22 (e); n = 23 


(f); n = 17 (g); n = 20 (h); n = 24 (i); n = 24 (j). k-m, Stomatal density 


n = 15; tmm, no induction n = 15, induction n = 15; er erll erl2, no induction 
n = 15, induction n = 15. 
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Extended Data Figure 4 | STOMAGEN co-suppression results in reduced k-n, Stomatal density (k), stomatal index (1), stomatal cluster distribution 
stomatal development in genetic backgrounds missing or blocked in EPF2- _(in %; m), and non-stomatal epidermal cell density (n) from 10-day-old abaxial 
ER and EPF1-ERLI signalling pathways. a-j, Representative confocalimages _ cotyledons of each genotype with or without carrying STOMAGEN-ami 

of cotyledon abaxial epidermis from 10-day-old light-grown transgenic construct. Error bars, s.e.m. ***P < 0.001; *P = 0.05; NS, not significant; Welch 
seedlings of the following genotypes: wild type (a); STOMAGEN-ami (b); epf2_ 2-sample t-test. n = 9-16. Total numbers of stomata counted: wt, 719; 

(c); epf2 STOMAGEN-ami (d); dominant-negative ER (ER(AK)) iner(e);ER(AK) | STOMAGEN-ami, 204; epf1, 1,004; epfl STOMAGEN-ami, 383; ERL1(AK) erll, 
er STOMAGEN-ami (£); epfl (g); epfl STOMAGEN-ami (h); dominant- 1,558; ERL1(AK) erll STOMAGEN-ami, 504; epf2, 1,505; epf2 STOMAGEN-ami, 
negative ERL1 (ERL1(4K)) in erl1 (i); ERL1(4K) erll STOMAGEN-ami 1,165; ER(AK) er, 1,361; ER(AK) er STOMAGEN-ami, 782; tmm, 2,495; tmm 
(j). STOMAGEN-ami markedly reduces stomatal differentiation in wild type STOMAGEN-ami, 2,688; er erll erl2, 1,853; er erll erl2 STOMAGEN-ami, 

(a, b). Blocking ER or lacking EPF2 produces small stomatal-lineage cells due to 2,028. Total numbers of non-stomatal epidermal cells counted: wt, 1,494; 
excessive entry divisions (c, e; yellow brackets). STOMAGEN-ami exaggerates the © STOMAGEN-ami, 1,299; epf1, 1,584, epfl STOMAGEN-ami, 2,711; ERL1(AK) 


small stomatal-lineage cells of epf2 (d; yellow brackets). STOMAGEN-ami erll, 871; ERL1(AK) erll STOMAGEN-amii, 1,348; epf2, 3,980; epf2 STOMAGEN- 
ER(AK) er shows excessive asymmetric entry as well as amplifying divisions ami, 8,808; ER(AK) er, 5,739; ER(AK) er STOMAGEN-amii, 6,939; tmm, 790; 

(f; yellow and pink brackets, respectively). Blocking ERL1 or lacking EPFl causes tmm STOMAGEN-amii, 962; er erll erl2, 479; er erll erl2 STOMAGEN-ami, 391. 
a stomatal pairing due to a violation of one-cell-spacing rule (g, i; dots). wt, n = 8; STOMAGEN-ami, n = 8; epfl, n = 9, epf1 STOMAGEN-ami, n = 17; 
STOMAGEN-ami suppresses these mild stomatal pairing phenotypes and ERL1(AK) erll, n = 13; ERL1(AK) erll STOMAGEN-ami, n = 9; epf2,n = 11; 
reduces stomatal differentiation (h, j). Images were taken under the same epf2 STOMAGEN-ami, n = 15; ER(AK) er, n = 9; ER(AK) er STOMAGEN-ami, 
magnification. Scale bars, 30 um. n = 13 (a); n = 26 (b); n = 15 (c); n = 23 n = 11; tmm, n = 8; tmm STOMAGEN-ami, n = 8; er erll erl2, n = 8; er erll 

(d); n = 11 (e); n = 17 (f); n = 12 (g); n = 22 (h); n = 18 (i); n = 13 (j). erl2 STOMAGEN-ami, n = 8. 
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Extended Data Figure 5 | STOMAGEN overexpression on stomatal 
development in tmm hypocotyl epidermis with combinatorial loss-of- 
function in ER-family genes: a complete set. a-r, Representative confocal 
microscopy images of hypocotyl epidermis from 10-day-old light-grown 
transgenic seedlings of the following genotypes, each carrying Est::STOMAGEN: 
wild-type (wt) (a, b); tmm (c, d); tmm er (e, f); tmm erl2 (g, h); tmm erll 

(i, j); tmm er erl2 (k, 1); tmm erl1 erl2 (m, n); tmm er erll (0, p); and tmm er erl1 
erl2 (q, r). A control, uninduced phenotype (a, ¢, e, g, i, k, m, 0, q); iSTOMAGEN 
(b, d, f, h, j, 1, n, p, r). iSTOMAGEN results in arrested stomatal precursor cells 
(asterisks) and stomatal-lineage ground cells (SLGCs; brackets) in tmm 
hypocotyls (d). iSTOMAGEN triggers entry divisions in tmm er and tmm erl2 
(f, h; brackets), and exaggerate the SLGC clusters in tmm er erl2 (k, 1; brackets). 
Images were taken under the same magnification. Scale bar, 30 um. n = 19 
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(h); 1 = 18 (i); n = 19 (j);n = 19 (k), n = 21 (1); n = 17 (m);n = 20 (n);n = 19 


(0); n = 21 (p); n = 20 (q); n = 20 (r). s, t, Stomatal index and SLGC 
index. s, ***P < 0.0001; **P < 0.01; *P < 0.5 (Wilcoxon rank sum test). NS, 
not significant. 0, no stomata or SLGC observed; n = 15. Total number of 
stomata and SLGCs counted; tmm non-induced, 0 and 0; induced, 0 and 211; 
tmm er non-induced, 0 and 0; induced, 0 and 308; tmm erl2 non-induced, 0 
and 32; induced, 0 and 171; tmm erll non-induced, 58 and 116; induced, 
142 and 138; tmm er erl2 non-induced, 0 and 270; induced, 10 and 676; tmm er 
erll non-induced, 422 and 283; induced, 817 and 422; tmm erll erl2 non- 
induced, 72 and 83; induced, 163 and 97; tmm er erll erl2 non-induced, 
1,229 and 295; induced, 1,068 and 222. n = 15 for all genotypes (s, t). 
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Extended Data Figure 6 | Association of Stomagen with ER-family 
receptors and TMM. a, Shown are co-immunoprecipitation assays of ligand- 
receptor pairs expressed in N. benthamiana leaves. The ectodomains and 
membrane-spanning domains of ER, ERL1 and ERL2 fused with GFP were 
separately expressed in N. benthamiana, and microsomal fractions were 
incubated with 1 1M Stomagen peptides followed by immunoprecipitation 
using anti-GFP (anti-GFP) antibody. Inputs and immunoprecipitates were 
immunoblotted using anti-GFP (anti-GFP) or anti-Stomagen (anti-Stomagen) 
antibodies. Experiments were repeated three times (three biological replicates). 
b, Co-immunoprecipitation of LURE2 peptide fused with hexa-histidine tag 
(LURE2-His) with N. benthamiana microsomal fractions expressing the 
ectodomains and membrane-spanning domains of ER and FLS2 fused with 


GFP, a full-length TMM fused with GEP, ora control, uninoculated leaf sample. 


Immunoprecipitation was performed using anti-GFP and immunoblotted 
using anti-GFP (for detection of receptors) or anti-His (for detection of 
LURE2-His) antibodies. Experiments were repeated twice (two biological 
replicates). c, Co-immunoprecipitation of Stomagen peptide with N. 
benthamiana microsomal fractions expressing the ectodomains and 
membrane-spanning domains of ER and FLS2 fused with GFP or a control, 
uninoculated leaf sample. Immunoprecipitation was performed using anti- 
GFP and immunoblotted using anti-GFP (for detection of receptors) or anti- 
Stomagen antibodies. Experiments were repeated four times (four biological 
replicates). 
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Extended Data Figure 7 | Purified mEPF2 and Stomagen recombinant 
peptides and separation of bioactive mEPF2 by reverse-phase 
chromatography. a, SDS-PAGE gel of purified and refolded mEPF2-MYC- 
HIS and Stomagen recombinant peptides (asterisks). Left: molecular mass 
markers. b, HPLC chromatogram of purified, refolded mEPF2. Peaks 1 and 2 
in UV chromatogram were collected and subjected to bioassays. c, Confocal 
image of cotyledon epidermis from wild-type seedling grown a solution with 
peak 1 for 5 days. No stoma is visible, indicating that peak 1 contains bioactive 
mEPF2. Scale bar, 20 pum. n = 19. d, Confocal image of cotyledon epidermis 
from wild-type seedling grown in a solution with peak 2 for 5 days, with normal 
stomatal differentiation, indicating that the peptide is not bioactive. Scale 


bar, 20 um. n = 9. 
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Extended Data Figure 8 | Separation of properly folded, bioactive Stomagen 
and mutant Stomagen peptides by reverse-phase chromatography followed 
by mass spectrometry and bioassays. a, HPLC chromatogram of purified, 
refolded Stomagen. Peaks 1 and 2 in UV chromatogram were collected and 
subjected to MALDI-TOF mass spectrometry (b, d) as well as for bioassays 
(c, e). b, MALDI-TOF spectrum of peak 1 from a. A single-charged peptide 
corresponding to synthetic Stomagen peptide was observed at m/z = 5,118.5 
({M+H]*) and a double charged peptide at m/z = 2,559.8 ({M+2H]?*). 

c, Confocal image of cotyledon epidermis from wild-type seedling grown a 
solution with peak 1. Severe stomatal clustering and overproduction of stomata 
are observed. Scale bar, 20 jum. n = 8.d, MALDI-TOF spectrum of peak 2 from 
a. e, Confocal image of cotyledon epidermis from wild-type seedling grown ina 
solution with peak 2 from a, with no stomatal clustering, indicating that the 


30.00 


Retention Time (min) 


fraction is not bioactive. Scale bar, 20 um. n = 6. f, HPLC chromatogram and 
bioassays of an independent batch of Stomagen peptides used for QCM analysis 
in direct comparison with non-folding mutant Stomagen peptides in Fig. 3c. 
Peaks 1 and 2 in UV chromatogram were collected and subjected for bioassays. 
Insets: confocal microscopy images of cotyledon epidermis from wild-type 
seedling grown a solution with peak 1 (bioactive) and peak 2 (non-active) for 
5 days. Scale bars, 50 um. n = 8 (peak 1); n = 6 (peak 2). g, HPLC 
chromatogram of purified, mutant Stomagen peptide in which all cysteine 
residues were substituted to serine residues (Stomagen_6C-—S). The mutant 
Stomagen peptide yielded a single peak, which was subjected for bioassays 
followed by confocal microscopy (inset). No stomatal clustering was observed, 
indicating that non-folding Stomagen peptide is not bioactive, confirming the 
previous results’*. Scale bar, 50 pm. n = 8 for each peptide treatment. 
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Extended Data Figure 9 | Raw QCM recording data. Shown are raw 
recording data of frequency shifts for representative QCM analysis using 
biosensor chips immobilized with ER(AK)-GFP and GFP (a, b, inset) after 
sequential injection of active Stomagen (a, c), mEPF2 (b), non-folding, inactive 
mutant Stomagen (c, inset), or LURE2 (d) in increasing concentrations. 
Bioactive Stomagen and inactive Stomagen experiments in c were performed 
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side by side. Arrows indicate time of additional peptide application. 
Numbers of experiments performed for each analysis: Stomagen-ER, n = 3; 
Stomagen-TMM, n = 2; Stomagen—GFP, n = 3; mEPF2-ER, n = 2; 
mEPF2-TMM, n = 3; mEPF2-GFP, n = 2; inactive Stomagen_C6—S-ER, 
n = 3; and LURE2-ER, n = 2. 
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Extended Data Table 1 | List of plasmids and primers used in this study 


Plasmid ID 
pKUT608 
pKMP127 
pJSL92 
pJSL93 
pJSL73 
pJSL75 


Primer names 
EPFL9 1 Xholf 
EPFL9 289 Spel rc 
FLS2 1 (GW) F 
FLS2 2625 rc 
FLS2 591 F 
elF4A F 
elF4AR 
STOMAGEN F 
STOMAGEN R 
pER8 Term R 
EPF1+207F 
EPF1+315re 


EPF2.1.cDNA.xhol 
EPF2.360.cDNA.ecoRI.rc2 


Description 

STOMAGEN in pENTR 
proEst::STOMAGEN in pER8 

ERL2 genomic AKinase in pENTR 
35S::gERL2-AKinase-GFP in pGWB5 
FLS2AK in pENTR 
35S:FLS2AK-GFP in pGWB5 


Sequences (5' to 3') 
CACCTCGAGATGAAGCATGAA 
ACTAGTTATCTATGACAAACAC 
CACCATGAAGTTACTCTCAAAGACCTTTTTG 
GATGTTGGCACTGTTGAATGAATCTGTTGC 
TGTAGCAGCTGGTAACCAT 
AGCCAGTGAGAATCTTGGTGAAGC. 
CTAGTACGGCAGAGCAAACACAGC. 
TGTAGTTCAAGCCTCAAGACCTC 
ACTCGTTGTACGTACAAGTTGGT 
TCGAAACCGATGATACGGACG 
ATGCCGTCTTGTGATGGTTAG 
TCAAGGGACAGGGTAGGACTT 
CACCCTCGAGATGACGAAGTTTGTACGCAAGT 
CGGAATTCTAGCTCTAGATGGCACGTGATAG 
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Insert 

STOMAGEN cDNA 
STOMAGEN cDNA 
ERL2 genomic AKinase 
ERL2 genomic AKinase 
FLS2AK cDNA no stop 
FLS2AK cDNA no stop 


Purpose 


molecular cloning (pKUT608 
molecular cloning (pKUT608 


molecular cloning (pJSL73 


( 
( 
( 
( 


molecular cloning (pJSL73 


Sequencing 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 
RT-PCR 


Vector 
pENTR 
pER8 
pENTR 
pGWB5 
pENTR 
pGWB5 


BacR 

KAN 
SPEC/STREP 
KAN 
KAN/HYG 
KAN 
KAN/HYG 


ARTICLE 


Plant R 
NA 

HYG 

NA 
KAN/HYG 
NA 
KAN/HYG 
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HIF-driven SF3B1 induces KHK-C to 
enforce fructolysis and heart disease 


Peter Mirtschink™, J aya Krishnan'*}, Fiona Grimm!+, Alexandre Sarre”, Manuel Hérl’, Melis Kayikci*+, Niklaus Fankhauser', 
Yann Christinat!, Cédric Cortijo!, Owen Feehan!, Ana Vukolic!, Samuel Sossalla°, Sebastian N. Stehr®, Jernej Ule*t, 


Nicola Zamboni’, Thierry Pedrazzini? & Wilhelm Krek! 


Fructose is a major component of dietary sugar and its overconsumption exacerbates key pathological features of 
metabolic syndrome. The central fructose-metabolising enzyme is ketohexokinase (KHK), which exists in two isoforms: 
KHK-A and KHK-C, generated through mutually exclusive alternative splicing of KHK pre-mRNAs. KHK-C displays 
superior affinity for fructose compared with KHK-A and is produced primarily in the liver, thus restricting fructose 
metabolism almost exclusively to this organ. Here we show that myocardial hypoxia actuates fructose metabolism in 
human and mouse models of pathological cardiac hypertrophy through hypoxia-inducible factor 1a (HIFla) activation 
of SF3B1 and SF3B1-mediated splice switching of KHK-A to KHK-C. Heart-specific depletion of SF3B1 or genetic ablation of 
Khk, but not Khk-A alone, in mice, suppresses pathological stress-induced fructose metabolism, growth and contractile 
dysfunction, thus defining signalling components and molecular underpinnings of a fructose metabolism regulatory 


system crucial for pathological growth. 


Heart failure is a leading cause of death worldwide. It is often preceded 
by hypertrophic growth of the organ as it strains to maintain function 
under conditions of pathologic stress’ and is associated with an 
increased demand for oxygen, owing to changes in coronary resist- 
ance, cardiac vascularisation and perfusion’. The ensuing state of 
hypoxia activates hypoxia-inducible factor (HIF)1o that mediates 
adaptive transcriptional responses to low-oxygen tension’ to support 
cardiomyocyte growth, myocardial angiogenesis and reprogramming 
of metabolism*. Unremitted pathologic stress leads ultimately to func- 
tional deterioration of the heart with the clinical signs and symptoms 
of heart failure’. 

Given that both HIF1« transcription and alternative pre-mRNA 
splicing are central to the control of metabolism, we embarked on a 
systematic and functional analysis of HIF1a-dependent splice factor 
expression and pre-mRNA splicing in pathologic cardiac hyper- 
trophy. We identified splice factors as HIF1o targets, which prompted 
the analysis of their RNA targets, unveiling mechanistic and func- 
tional linkages between HIFla, splice factor 3b subunit 1 (SF3B1), 
KHK-C splice isoform production and fructose metabolism in cardiac 
hypertrophy. 


The HIFla-SF3B1-KHK-C axis 


To deduce changes in both gene expression and alternative splicing 
as a function of Hifla, we followed the experimental workflow 
illustrated in Extended Data Fig. la. RNA isolated from normoxic 
and hypoxic neonatal mouse cardiomyocytes (NMCs), depleted (or 
not) of Hifla by specific short hairpin RNAs (shHifla), was subjected 
to a splice-junction microarray analysis. Genes exhibiting a Hifla- 
dependent expression profile were manually checked for an involve- 
ment in pre-mRNA splicing and, in parallel, matched to a list of 


329 unique genes comprised of validated and in silico-predicted splice 
factors* * (Supplementary Fig. 1 and Supplementary Table 1). Twenty 
genes with characteristics of splice factors/regulators displayed a 
Hifla-dependent expression pattern (Fig. 1a), of which four, Clqbp, 
Hnrnph3, Jmjd6 and Sf3b1, contained conserved hypoxia-responsive 
elements (HREs) in their promoters (Extended Data Fig. 1b). 

Data derived from the splice-junction microarray analysis were 
also interrogated for changes in gene splicing as a function of Hifla 
and further filtered for their potential association to metabolism, 
leading to Ard1 (also known as Naa10), Hk1, Khk, and Pfkfb3, which 
exhibited the most consistent and significant changes (Fig. 1b). The 
Hifla dependency of Khk alternative splicing implied a link between 
hypoxia and fructose metabolism, especially since the dominant 
Khk isoform induced by hypoxia, Khk-C, has a superior activity over 
Khk-A° (Supplementary Fig. 2). 

Analysis of Khk alternative splicing in NMCs by real-time PCR 
(RT-PCR) revealed Hifla-mediated, Sf3b1-dependent induction of 
Khk-C mRNAs, while mRNA levels of Khk-A declined (Fig. 1c and 
Extended Data Fig. 1c-e). Isoproterenol treatment yielded similar 
results (Extended Data Fig. 1f, g). Also, in mouse models of cardiac 
hypertrophy, specifically one-kidney one-clip (1K1C), trans-aortic con- 
striction (TAC) and isoproterenol infusion, Khk-C mRNA and protein 
production increased over sham-treated control mice (Fig. 1d, e and 
Extended Data Fig. 1h-n). Hifla target and hypertrophic marker gene 
expression were dramatically increased in all three cardiomyopathy 
models (Extended Data Fig. lo and Supplementary Fig. 3a-c). 

Analysis of HIFla, SF3B1 and KHK-C in ventricular biopsies of 
aortic stenosis or hypertrophic cardiomyopathy (HCM) patients corro- 
borated the results obtained in mice both at the level of mRNA and 
protein expression (Fig. 1f-h and Extended Data Fig. 1p). mRNAs of 
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Figure 1 | Identification of Hifla-dependent splicing factors and splicing 
events. a, Expression of splice factors in NMCs transduced and cultured as 
indicated versus NMCs treated with non-silencing shRNA (shNs) and cultured 
at 21% Op. b, Hit list of metabolic genes. c, d,; NMCs transduced as indicated 
(c) and ventricular samples from sham- and 1K1C-treated mice (d) were 
analysed for Khk-A and Khk-C expression by RT-PCR and HinclI digestion. 
Primers to amplify the KHK variable region are indicated (c, upper). The 
hypothetical HIF1a-SF3B1-KHK-C axis (c, right). e, Ventricular samples from 
sham- and 1K1C-treated mice and lysates of NMCs overexpressing KHK-A or 
KHK-C were processed for immunoblotting with antibodies against denoted 
proteins. A and C indicate the differential migration of KHK-C and KHK-A on 
SDS gels. f, Biopsies of left ventricles from aortic stenosis patients and healthy 


HIF 1o targets and hypertrophic markers were also induced (Extended 
Data Fig. 1q and Supplementary Fig. 3d). Thus, HIFlo, SF3B1 and 
KHK-C are coordinately upregulated in different mouse models of 
cardiac hypertrophy and biopsies of human cardiomyopathy patients. 


The HIF la target SF3B1 mediates splicing of KHK 


Sf3b1 appears to represent a direct Hifla target. First, treatment of 
NMCs with phenylephrine or isoproterenol, known inducers of Hifla 
and Hiflo-target genes’, increased $f3b1 mRNA expression in a 
Hiflo-dependent manner (Extended Data Fig. 2a). Second, Sf3b1 
mRNA expression also increased upon depletion of the Von Hippel- 
Lindau protein (pVhl) or ectopic expression of HIFlo lacking the oxy- 
gen-dependent degradation domain (HIFlaAODD) (Extended Data 
Fig. 2b). Third, the promoter of SF3B1 contains a highly conserved 
HRE (Extended Data Fig. 2c). Fourth, expression of HIFlaAODD 
activated a mouse wild-type S/3b1-promoter-luciferase reporter but 
not the corresponding reporter harbouring a mutant HRE (Fig. 2a). 
Fifth, chromatin immunoprecipitation (ChIP) assays demonstrate an 
association of Hifla with the HRE in the Sf3b1 promoter in native 
chromatin (Fig. 2b). 

Next we measured the incorporation of radioactively labelled 
[*H]leucine in NMCs depleted of Sf3b1 and exposed to various treat- 
ments including phenylephrine or isoproterenol (Extended Data 
Fig. 2d), expression of HIFlaAODD (Extended Data Fig. 2e), hypoxia 
(Extended Data Fig. 2f) or simultaneous depletion of pVhl (Extended 
Data Fig. 2g). We observed that [*H]leucine incorporation (Extended 
Data Fig. 2i-k), o/ myosin heavy chain mRNA ratio (Extended Data 
Fig. 2l-o) and cell size (Extended Data Fig. 2p) changed as a function 
of Sf3b1. Other components of the U2 small nuclear ribonucleopro- 
tein (U2 snRNP) complex, namely Sf3b2, Sf3b3 and Sf3a3, neither 
increased in abundance in response to Hiflo expression nor did they 
affect, when downregulated by different shRNAs, to a measurable 
extent, Hiflo-stimulated cell growth (Extended Data Fig. 2q, r and 
Supplementary Fig. 4). 

Also, known parameters of Hiflo-driven metabolism including 
extracellular acidification rate (ECAR; an indicator lactate produc- 
tion) and oxygen consumption rate (OCR; an indicator of oxidative 
phosphorylation) changed in a Sf3b1-dependent manner in NMCs. 


controls were assessed for SF3B1, KHK-A and KHK-C expression by qPCR 
(n = 16 for patient samples and n = 6 for controls; error bars are s.e.m.; 

**P < 0.01; ***P < 0.001; two-tailed unpaired t-test). g, Top, schematic 
representation of exon 2 to 5 of human KHK. Alternative usage of exon 3 
(Ex3 alt.) is displayed in blue (exon 3A) and red (exon 3C). Primer sets location 
and orientation used for RT-PCR analysis, are indicated with arrows. Bottom, 
assessment of KHK-A and KHK-C expression by RT-PCR in ventricles of 
controls and HCM patients using primer set 1 and 2 as described above. 

h, Lysates of left-ventricular samples from healthy controls and aortic stenosis 
or HCM patients were processed for immunoblotting with antibodies 

against denoted proteins. 


At basal levels, ECAR was not affected by depletion of Sf3b1 but 
increased upon addition of isoproterenol (Extended Data Fig. 2s, t). 
OCR was repressed, as expected, in isoproterenol-treated cells upon 
addition of either oleic acid or carbonyl cyanide-4-(trifluoromethoxy)- 
phenylhydrazone, an effect diminished by depletion of Sf3b1 (Extended 
Data Fig. 2u). Ectopic expression of HIFlaAODD produced similar 
effects on ECAR and OCR as isoproterenol (Extended Data Fig. 2v-z), 
while depletion of Sf3b1 alone had no effect (Supplementary Fig. 5). In 
sum, these results suggest a key contributory role for Sf3b1 in Hifla- 
mediated growth and metabolic programs. 

Sf3b1 participates in the catalysis of splicing'', can be crosslinked 
to the 5’ and 3’ of the branch point adenosine”, and displays differ- 
ential effects on specific splice targets under conditions of reduced 
function’*"*. Indeed, splice-junction analysis revealed widespread 
effects on both alternative splicing and gene expression that included 
changes in Khk isoform expression when Sf3b1 was depleted in 
hypoxic NMCs (Supplementary Tables 2 and 3). RNA-immunopre- 
cipiations of Sf3b1 from extracts of NMCs exposed to hypoxia or 
HIFl1aAODD overexpression, demonstrate that Sf3b1 occupies 
RNA sequences upstream of exon 3C (and not of exon 3A) of Khk 
pre-mRNA in hypoxia but not normoxia (Fig. 2c). Upon depletion of 
Hifla, Sf3b1 was redirected towards branch-point sequences of exon 
3A (Fig. 2c). This suggests that Sf3b1 association with RNA sequences 
upstream of exon 3C of Khk requires Hifla-signalling. 

Sf3b1 acts as a direct mediator of Khk splicing as its overexpression 
caused a shift from Khk-A to Khk-C (Extended Data Fig. 3a). Selective 
depletion of endogenously (but not exogenously) produced Sf3b1 
prevented this shift (Extended Data Fig. 3a). Quantitative PCR 
(qPCR) analysis confirmed the overexpression and/or reduction of 
Sf3b1 mRNAs where expected (Extended Data Fig. 3b). Concomitant 
with the shift in Khk isoform expression towards Khk-C, cell growth 
was induced (Extended Data Fig. 3c). Neither the overproduction of 
SF3A3 or SF3B3 (Extended Data Fig. 3d, e), nor the depletion of Sf3a3, 
Sf3b3 or Sf3b2 by corresponding shRNAs, generated a similar robust 
splice shift towards Khk-C despite efficient depletion of the respective 
proteins (Extended Data Fig. 3f, g and Supplementary Fig. 4). Cell 
growth was also not affected by SF3A3 or SF3B3 expression (Extended 
Data Fig. 3h). 


25 JUNE 2015 | VOL 522 | NATURE | 445 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b 
20 pGL3 basic sista mats 21% O, 3% O, 
is  pGL3 Sf8b1 WT 3 3 
i pGL3 Sf3b1 AHRE a BE 
= 5 i = 3 6 = 
Zz 10 ge g & oD a & Dm & 
a | i a 
o+ =, = 21% O, 3% O, 
Empty Hif1 c~AODD Empty aReene Empty Empty 
c shNs shNs shNs shHifto 
130 bp 
> ee g Pe Pe Pe 
175 bp 
A 
>! 


----+ 48 bp 
if: 
\ Zz 
ma U2 snRNP complex 


XN 


d 2 > e 
2 I a RR 
g 25 § 5 shNs 7 
fl 2 
i i B 20 > g 44 BshSf3b1 B 
+ Ea 1.5 20 
Glucose-6-phosphate 10 Se 3 
Fructose-6-phosphate : os 
Fructose-1,6-bisphosphate 0.5 ve 2 
Fructose-1-phosphate fe} g £ om 
Glucose-1-phosphate > 1 aE 
Galactose-6-phosphate -0.5 € 
Galactose-1-phosphate -1.0 307 1 1 
ODE neKose oe Empty HIF1aAODD 
log 2 value 
f 25 uM [18C}]frc/5 mM ['2C]gic g 
12, 13; 
15 B 25 uM ['2C}]frc/5 ye ‘C]gic > 257 9shNs te 
% < 204QshKhkA | 
a 32 Mshkhk-C + 
7 10 S815 
we -5 - = 
9 Teioln a 
m5 > 9 . 
BE OS 
oO 
0+ =a sa aa «= oft mit it 
shNs " shSf3b1_ shNs " shStabt 
Empty “HIFTGAODD~ Empty HIF10AODD 


Figure 2 | SF3B1 is a HIF1a-target gene that mediates splicing of KHK. 

a, Co-transfection of wild type (WT) or HRE-mutated SF3B1 promoters fused 
to luciferase with either an empty vector control or HIFIaAODD (values 

are relative light units (r.l.u.); n = 4 biological replicates per group). b, NMCs 
were cultured as indicated and processed for chromatin immunoprecipitation 
with a HIFla-specific antibody (IP: HIF1«) or with a control isotype- 
matched antibody (IgG control). c, Schematic representing branch-point 
sequences located in front of exon 3A (upper left) or exon 3C (lower left). 
Branch point adenine is labelled in red. Location and orientation of primers 
(arrows), expected length of fragments to be amplified by RT-PCR and 
potential Sf3b1 binding sites are depicted. NMCs treated as indicated were 
processed for RNA-immunoprecipitation assay with antibodies specific for 
Sf3b1 (IP) or control-isotype matched antibody (IgG). Recovered RNA was 
purified and analysed by RT-PCR (upper right and lower right). d, Mass 
spectrometry-based metabolomic profiling of NMCs transduced with 
lentiviruses as denoted focused on hexose-phosphates. Metabolite values are 
compared to amounts in control transduced NMCs (n = 4 biological replicates 
per group). e, KHK activity in lysates of NMCs transduced with lentiviruses as 
indicated (n = 7 biological replicates per group). f, Relative amount of 

[°C] fructose-1-phosphate derived from ['?C] fructose ({!*C]frc) or 

[°c] glucose (°c) glc) in NMCs transduced as denoted (n = 4 biological 
replicates per group). g, ["H]leucine incorporation in NMCs infected as 
indicated (n = 3 biological replicates per group, data show 1 of 3 representative 
experiments). For panels a, e—g, *P < 0.05; **P < 0.01; ***P < 0.001. Two- 
tailed unpaired t-test (a) one-way ANOVA followed by Dunnett’s multiple 
comparison post-test (e—g). Error bars are s.d. (a, g) or s.e.m. (e, f). 


Next we proceeded with mass-spectrometry-based metabolomic 
profiling of hexose-phosphates in NMCs expressing HIFlaAODD 
in the absence or presence of shSf3b1. We observed profoundly 
increased levels of fructose-1-phosphate (FIP) and glucose-6- 
phosphate in response to HIF1lo activation (Fig. 2d), but only the 
level of the former was Sf3b1-dependent (Fig. 2d). In accord with 
increased levels of F1P, HIFI1xaAODD also increased total cellular 
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Khk activity by 3.6-fold compared to control NMCs (Fig. 2e). In a 
comparative analysis, overexpressed human KHK-C displayed, in 
NMCs depleted for endogenous Khk, a 3.7-fold higher kinase activity 
than KHK-A (Extended Data Fig. 3i, j). Thus, activation of the Hifla- 
Sf3b1 axis activates Khk-C to stimulate F1P production. 

To assess whether increased levels of F1P are derived from external 
fructose or from conversion of glucose via the polyol pathway’*, we 
supplied NMC culture media with ['*C]fructose and unlabelled glu- 
cose or [°C] glucose and unlabelled fructose and measured [°C] FIP 
production. ['°C]F1P levels increased upon HIF1aAODD expression 
selectively in NMCs cultured with ['*C] fructose and unlabelled glucose 
but not ['*C] glucose and unlabelled fructose ina manner dependent on 
Sb1 (Fig. 2f). Moreover, KHK-C, but not KHK-A, expression induced 
accumulation of ['°C]F1P levels in a time-dependent fashion in NMCs 
cultured with ['*C]fructose (Extended Data Fig. 3k). Glut5 fructose 
transporter mRNA expression was likewise induced by KHK-C and 
HIF1aAODD (Extended Data Fig. 31). 

Khk-C is also a critical downstream mediator of Hifla-induced cell 
growth as selective depletion of Khk-C, but not Khk-A, rescued 
HIFlaAODD-induced cardiomyocyte growth (Fig. 2g and Extended 
Data Fig. 3m). Knockdown efficiency and isoform specificity of the 
corresponding shRNAs targeting Khk isoforms was confirmed by 
qPCR (Extended Data Fig. 3n). Taken together, Sf3b1 appears to be 
a direct mediator of Khk pre-mRNA splicing, resulting in Khk-C 
expression, increased Khk activity, fructose uptake and cell growth. 


Expression of KHK-C promotes anabolic metabolism 


To characterize metabolic phenotypes in NMCs associated with 
activation of the HIFla-SF3B1-KHK-C axis and as a function of 
fructose availability, we performed metabolomic profiling. These 
efforts established the following experimental points. First, in the 
presence of fructose and glucose, HIFlaAODD expression increased 
glycolysis (for example, hexose-phosphates), amino acid metabolism 
and biosynthesis (for example, dexfosfoserine, aspartate, arginine), 
the non-oxidative pentose-phosphate pathway (PPP; for example, 
sedoheptulose-7-phosphate, pentose phosphates) and to a lesser 
degree also the oxidative PPP (for example, 6-phospho-p-gluconate, 
pentose phosphates) and intermediates of DNA/RNA biosynthesis 
(for example, N-formyl-GAR, folate, deoxyuridine) (Fig. 3). Changes 
in redox balance are exemplified by lower levels of NADPH, NADH 
and of the reduced state of glutathione (GSSG) (Fig. 3, lanes 4 and 5). 
Reduced levels of these metabolites are probably the result of extensive 
utilization of reducing equivalents for macromolecular biosynthesis’®. 
Second, depletion of Khk-C (but not Khk-A) reverted HIFlwAODD- 
induced metabolic changes (Fig. 3, compare lane 11 with 5 and 4), 
while its overexpression mimicked them (Fig. 3, compare lane 1—3 
and 4—5). Third, when KHK-C was overexpressed under fructose- 
lacking conditions, levels of glycolytic and PPP metabolites decreased 
and the reduction of NADPH and NADH was less pronounced (Fig. 3, 
compare lane 2 and 3; Extended Data Fig. 4a). Hence, exogenous 
fructose contributes significantly to the metabolic effects mediated 
by Khk-C. 

In accord with the above, we detected [*H] radiolabel derived from 
[PH] fructose in the RNA, DNA and protein fractions of NMCs in 
a HIFlaAODD- and KHK-C-dependent manner (Extended Data 
Fig. 4b). Expression of KHK-C stimulated also glucose uptake in the 
presence of fructose (Extended Data Fig. 4c). Moreover, [PH]leucine 
incorporation into NMCs increased as a function of fructose availabil- 
ity ina KHK-C-dependent fashion (Extended Data Fig. 4d). Increasing 
glucose availability in the presence of fructose had similar effects 
(Extended Data Fig. 4d). Immunoblotting confirmed the expected 
expression pattern of KHK-A and KHK-C (Supplementary Fig. 6a). 
These results suggest a critical role for KHK-C in HIFla-dependent 
metabolic reprogramming and growth. 

Since phosphorylation of fructose by Khk-C is unrestrained, it 
could contribute to ATP-depletion as shown previously in liver’’, 
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Figure 3 | Anabolic metabolism and growth as a function of KHK-C. 
Untargeted analysis of metabolites in NMCs treated as denoted. Depicted are 
metabolites with log,(fold change) > 0.5 and adjusted P value < 0.01 in at least 
one treatment group compared to corresponding control (n = 4 biological 
replicates per group). 


thereby limiting ATP-driven allosteric inactivation of glycolysis at the 
level of phosphofructokinase (PFK)'*"°. When HIFlaAODD was 
overexpressed in NMCs, the ADP/ATP ratios increased (Extended 
Data Fig. 4e). Simultaneous depletion of either Sf3b1 or Khk-C (but 
not Khk-A) prevented this (Extended Data Fig. 4e). Khk-C but not 
Khk-A, when overexpressed, enhanced ADP/ATP ratios (Extended 
Data Fig. 4e). Thus, activation of the HIFla-SF3B1-KHK-C axis 
increases ADP/ATP ratios. 

Hypoxia or activation of HIF1« drives an accumulation of lipids in 
heart hypertrophy’°”®. Since the FIP product DHAP is important for 
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formation of the glycerol backbone in triglyceride synthesis, we tested 
the Khk-C-dependency of this phenomenon by Oil Red O staining 
of lipid droplets in oleic-acid-treated NMCs. Overexpression of 
HIF1aAODD or exposure to hypoxia induced lipid droplet forma- 
tion, which was prevented upon depletion of Khk-C but not Khk-A 
(Extended Data Fig. 4f-h). Similar results were obtained when 
lipid levels were quantified with Nile Red (Supplementary Fig. 6b, 
c). Khk-C, but not Khk-A, was also required for phenylephrine- 
induced lipid accumulation (Supplementary Fig. 6d). Fructose carbon 
contributes to triglyceride synthesis in NMCs as measured by the 
amount of fructose-derived [*H] in the lipid fraction of NMCs expres- 
sing HIFlxAODD (Supplementary Fig. 6e). Downregulation of 
Khk-A had no effect, but depletion of either Khk-C, Glut5 or Sf3b1 
prevented HIFla-driven conversion of fructose carbon into lipids 
(Supplementary Fig. 6e, f). In a gain-of-function experiment, over- 
expression of KHK-C promoted fructose carbon to lipid conversion 
while KHK-A expression did not (Supplementary Fig. 6e). 

We also investigated the contribution of the two different Khk- 
isoforms to Hiflo-driven changes in OCR and ECAR in NMCs 
(Extended Data Fig. 4i—1). Khk-A depletion affected neither the 
ECAR nor the OCR of HIFlaAODD-expressing cells (Extended 
Data Fig. 4i), while depletion of Khk-C did (Extended Data Fig. 4)). 
In the absence of HIFIaAODD, neither Khk-A nor Khk-C depletion 
alone demonstrated any measurable effects on ECAR or OCR 
(Supplementary Fig. 6g, h). In a converse experiment, overexpression 
of KHK-A alone had little, if any, effect on these parameters while 
KHK-C overexpression increased ECAR and decreased OCR 
(Extended Data Fig. 4k, 1). Thus, by stimulating fructose metabolism 
in cardiomyocytes, Khk-C expression promotes conversion of fruct- 
ose carbon to lipids, suppresses mitochondrial oxidative phosphory- 
lation and increases glycolysis. 


Cardiac growth and dysfunction depends on Sf3b1 


Next we assessed whether myocardial fructose and glucose uptake is 
induced in an in vivo model of cardiac hypertrophy (Extended Data 
Fig. 5a). When TAC-treated and sham-operated control mice were 
simultaneously exposed to [PH] fructose and ['*C]deoxyglucose, 
they accumulated to higher levels in diseased hearts (Extended Data 
Fig. 5b, c). Hearts of mice after 1K1C surgery (Extended Data Fig. 5d), 
demonstrated a 2.5-fold increase of fructose levels (Extended Data 
Fig. 5e). (4C] fructose or ['*C] (deoxy) glucose uptake was also stimu- 
lated under conditions of hypertrophy in ex vivo ventricular tissue 
(Extended Data Fig. 5f, g). In line with this, Glut5 expression was 
induced in these hypertrophy models (Extended Data Fig. 5h, i) in 
hearts of isoproterenol-treated mice and in left-ventricular biopsies of 
aortic stenosis and HCM-patients (Extended Data Fig. 5j-l). In con- 
trast, expression of aldose reductase (AKR1B1), which converts glu- 
cose to sorbitol in the polyol pathway, was not induced in the latter 
samples (Extended Data Fig. 5k, 1). Thus, cardiac hypertrophy is 
associated with increased fructose levels and uptake. 

To study the Sf3b1-dependency of these phenomena in response to 
pathologic stress, we designed a viral-based vector system enabling 
ventricular-specific knockdown of Sf3b1 that is based on an AAV9 
backbone containing a U6 promoter followed by a TATA-lox site, a 
CMV promoter driving eGFP, another TATA-lox site and shRNAs 
targeting Sf3b1 (Fig. 4a). We constructed two viruses each expressing 
different shRNAs targeting Sf3bl (AAV9-fl/fl-shSf3b1#1 and 
AAV9-fl/fl-shSf3b1#2). Validation of these recombinant viruses 
was performed both in vitro in NMCs and in vivo in Mlc2v-cre~ 
and Mlc2v-cre’ mice (Extended Data Fig. 5m, n and Supplemen- 
tary Fig. 7a—d). 

While injection of AAV9-fI/fl-shSf3b1#1 into Mlc2v-cre* or Mlc2v- 
cre mice failed to cause any effects on ventricular size in sham-oper- 
ated mice, AAV9-fI/fl-shSf3b1#1 injection protected Mlc2v-cre* but 
not Mic2v-cre littermates from 1K1C-induced cardiac hypertrophy 
(Fig. 4b) and diastolic dysfunction (Fig. 4c and Extended Data Table 1). 
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Figure 4 | SF3B1 is required for pathologic cardiac growth and function. 
a, Schematic representation of the AAV9-fl/fl-shSf3b1 virus before and after 
Cre-mediated recombination (left panel) and of the experimental timeline 
(right panel). bd, Left-ventricular weight normalized to tibia length 

(LV/TL) (b), isovolumic relaxation time (IVRT) (c) and ventricular fructose 
levels (d) of sham- or 1K1C-treated Mlc2v-cre” and Mlc2v-cre* mice injected 
with AAV9-fl/fl-shSf3b1#1 viruses. e, Immunoblots of heart lysates from 
Milc2v-cre” and Mlc2v-cre* mice operated and transduced as in b-d using 


Like hypertrophic growth and function, the increase in fructose 
levels was also dependent on Sf3b1 (Fig. 4d). Immunoblotting of lysates 
from these sham- or 1K1C-operated Mlc2v-cre™ and Mlc2v-cre* mice 
confirmed that Sf3b1 was depleted where expected, preventing the 
generation of Khk-C isoform in ventricles of 1K1C-treated Mlc2v- 
cre’ mice (Fig. 4e). 

We also analysed the role of Sf3b1 in the TAC model. Application 
of both AAV9-fl/fl-shSf3b1#1 and AAV9-f1/fl-shSf3b1#2 viruses pro- 
tected Mlc2v-cre* but not Mlc2v-cre littermates from cardiac hyper- 
trophy (Fig. 4f; Extended Data Fig. 50 and Extended Data Table 1) and 
systolic dysfunction (Fig. 4g; Extended Data Fig. 5p and Extended 
Data Table 1). Immunoblotting confirmed that Sf3b1 was downregu- 
lated and prevented the switch to the Khk-C isoform in response to 
TAC-induced pathologic stress in hearts of Mlc2v-cre* mice (Fig. 4h 
and Extended Data Fig. 5q). These results suggest that Sf3b1 is 
required for the development of pathologic cardiac hypertrophy. 


Deletion of Khk protects from cardiac growth 

To assess the importance of Khk-C in cardiac hypertrophy and func- 
tion in the context of pathologic stress, we investigated two genetic 
mouse models, which were subjected to TAC and 1KIC stresses. 
Khk-A~’~ mice only express Khk-C, whereas Khk-A/C’~ mice are 
deleted for both isoforms”. Exposure of wild-type and Khk-A ‘~ 
mice to TAC treatment induced cardiac hypertrophy and systolic 
dysfunction in both mouse strains (Extended Data Fig. 6a-e and 
Extended Data Table 2). ATP measurements in left ventricles of 
sham- or TAC-treated wild-type and Khk-A~’~ mice revealed a 
reduction of ATP levels in hearts after TAC compared to control mice 
(Extended Data Fig. 6f). 

Hearts of Khk-A/C’’~ mice, when exposed to TAC, did not display 
signs of hypertrophy or systolic dysfunction (Extended Data Fig. 6g-k 
and Extended Data Table 2). Moreover, ATP levels remained 
unchanged in Khk-A/C-’~ mice exposed to TAC (Extended Data 
Fig. 61). All TAC-treated hearts accumulated Hifla and Sf3b1 in 
the left ventricle (Extended Data Fig. 6m, n). Khk-C expression was 
only detectable in TAC-treated hypertrophied ventricles, but not in 
the corresponding hearts from Khk-A/C-’~ mice (Extended Data 
Fig. 6m, n). 
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indicated antibodies. f, g, LV/TL (f) and ejection fraction (EF) (g) of sham- 
or TAC-treated Mlc2v-cre” and Mlc2v-cre* mice injected as indicated. 

h, Lysates of hearts from mice transduced and operated as in f were 
processed for immunoblotting with antibodies against denoted proteins. For 
a-d, f, g, number of mice per group is given in Extended Data Table 1. 
Error bars are s.e.m. *P < 0.05; **P < 0.01; two-tailed unpaired t-test 
(b-d, f, g). 


1KIC treatment of Khk-A~/~ and Khk-A*’* mice caused signifi- 
cant left-ventricular growth (Fig. 5a, b; Extended Data Fig. 60-q and 
Extended Data Table 3) and diastolic dysfunction (Fig. 5c and 
Extended Data Table 3). Hifla and Sf3b1 levels were induced in left 
ventricles of both 1K1C-treated strains, whereas Khk-A expression in 
sham-treated control mice was depleted as expected (Fig. 5d). 
Additional deletion of Khk-C protected hearts of these pathologies 
(Fig. 5e-g; Extended Data Fig. 6r-t and Extended Data Table 3). 
Immunoblotting confirmed activation of Hifla and Sf3b1 upon 
pathologic stress, and that Khk-C was only expressed in wild-type 
and Khk-A~’~ but not in Khk-A/C’’~ ventricles (Fig. 5h). Arterial 
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Figure 5 | Khk-A/C '~ mice are protected from pathologic cardiac growth 
and contractile dysfunction. a—c, Left-ventricular weight normalized to 
tibia length (LV/TL) (a), representative images of left ventricles (b) and 
isovolumic relaxation time (IVRT) (c) of sham- or 1K1C-operated Khk-At/* 
and Khk-A ’~ mice. d, Lysates from ventricles of sham- or 1K1C-operated 
Khk-A*’* and Khk-A~/~ mice were processed for immunoblotting with 
antibodies as indicated. e-g, Left-ventricular weight normalized to tibia length 
(e), representative images of left ventricles (f) and isovolumic relaxation 

time (g) of sham- or 1K1C-operated Khk-A/C‘’* and Khk-A/C’~ mice. 

h, Lysates from ventricles of sham- or 1K1C-operated Khk-A/ ct’* and 
Khk-A/C’‘~ mice processed for immunoblotting with antibodies as indicated. 
For a, ¢, e, g, *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired 
t-test; number of mice per group is given in Extended Data Table 2. Error 
bars are s.e.m. 
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blood pressure was similar in these mouse strains following 1K1C 
treatment (Extended Data Fig. 6u, v). In sum, only deletion of Khk- 
A in combination with Khk-C, but not Khk-A alone, prevents the 
development of pathologic growth and cardiac dysfunction upon 
different stresses. 


Discussion 


Our results demonstrate that fructose metabolism can be provoked in 
the heart by pathologic stimuli through HIF1« activation of SF3B1- 
mediated splicing of KHK pre-mRNA (Extended Data Fig. 7). The 
potential general importance of these findings draw on the central role 
occupied by HIFla in human disease, highlighting the HIFlo- 
SF3B1-KHK-C axis as a promising therapeutic target. 

Splicing output of the HIFlo target SF3B1, which contributes to 
intron removal by anchoring pre-mRNA onto the spliceosome”, is 
sensitive to changes in the abundance of SF3B1'**”. This feature of 
SF3B1 to expand and/or change splice target spectrum as a function of 
abundance may be exploited by HIF1« to affect KHK-C production. 
That the latter is less sensitive to changes in the levels of other factors 
of the U2snRNP complex investigated here implies that SF3B1 is 
either limiting for U2snRNP assembly/function or occupies a specific 
role in this context. Irrespective, this process is likely influenced by 
additional positive- and/or negative-acting factors, as has been 
demonstrated for PKM2”’. 

The criticality of KHK-C function in pathologic cardiac growth 
may manifest at multiple levels. Unrestrained KHK-C-driven phos- 
phorylation of fructose may limit ATP levels and thus restrict 
ATP-mediated negative feedback inhibition of PFK, facilitating high 
glycolytic flux. The latter may be further enhanced by an increased 
availability of F1P, as this metabolite has been shown to induce glu- 
cokinase and pyruvate kinase, at least in the liver”’. Apart from posi- 
tively affecting glucose metabolism, KHK-C and fructose availability 
also increased nucleic acid, lipid and protein biosynthesis, which may 
support growth. Although our work points to a fructose-metabolism- 
related function of KHK-C in cardiac hypertrophy, the source of 
fructose could, under certain circumstances, vary and originate alter- 
natively from glucose’. In addition, yet-to-be-discovered functions of 
KHK-C unrelated to fructose metabolism (that may, for example, 
relate to gene regulatory activities) could contribute to cell growth 
control”*. 

Biopsies of heart disease patients demonstrate increased expression 
of HIFla, SF3B1 and KHK-C. Since aortic stenosis and diabetes are 
frequent comorbidities in senescent humans” and diabetic patients 
tend to have also higher fructose levels in serum’’, the impact of the 
HIF1o-SF3B1-KHK-C axis in these patients might be particularly 
important. With respect to other disease states, SF3B1 is found recur- 
rently mutated in human cancers”. It is conceivable that some of these 
cancer-associated mutations affect alternative splicing of KHK and 
fructose metabolism in cancer cells independent of hypoxia, thereby 
contributing to tumour development. 


Online Content Methods, along with any additional Extended Data display iterns 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Human ventricular biopsies. Left-ventricular tissues were obtained from 
patients with hypertrophic cardiomyopathy and aortic stenosis (Supplementary 
Table 4) and healthy controls. Additional left-ventricular biopsies were provided 
by W. Linke (University of Miinster). Clinical and demographical data from the 
corresponding patients were previously published”’. All procedures were in com- 
pliance with local ethical committees. An informed consent was obtained from all 
patients included in the study. 

Mouse strains. Khk-A ’~ and Khk-A/C ‘~ mice were obtained from A. Asipu 
(University of Leeds) and R. Johnson (University of Colorado). MLC2v-cre line 
was from J. Chen (University of California). All mice were maintained in a 
conventional animal facility at RCHCI and EPIC, ETH Ziirich. Maintenance 
and animal experimentation were in accordance with the Swiss Federal 
Veterinary Office (BVET) guidelines. Data presented in this manuscript repres- 
ent studies with male mice. After baseline echocardiography mice were randomly 
distributed to sham or treatment groups. Animal surgeries and echocardiography 
was performed blinded, that is the involved veterinarian was not aware of the 
genotype of the mice before or during operation. Animal numbers for experi- 
ments were chosen based on expected mortality rates and phenotypical or func- 
tional changes of hearts in wild-type mice due to treatment/surgery. Animals 
were excluded from the study in case of death before the experimental endpoint or 
based on the evaluation of pain using a standardized score sheet, approved by 
BVET. 

One-kidney, one-clip. Six- to seven-week-old Khk-A~‘~, Khk-A/C’~ and cor- 
responding Khk-A*’* and Khk-A/C*’* male mice with a pure C57/BL6J back- 
ground as well as Mlc2v-cre* and Mlc2v-cre_ male mice with a C57/BL6] 129S 
mixed background were anaesthetized and surgery was performed as previously 
described" by inhalation of isoflurane (4% in oxygen, 11 min™'). A sham pro- 
cedure, which included the entire surgery with the exception of artery clipping, 
was applied to control mice. 

Trans-aortic constriction. 10-12-week-old Khk-A ’~, Khk-A/C ’~ and corres- 
ponding Khk-A*’* and Khk-A/C*/* male mice as well as Mlc2v-cre* and Mlc2v- 
cre male mice were subjected to pressure overload by trans-aortic constriction 
(TAC) through constriction of the descending aorta as described”. The mice were 
monitored for 14-18 days after surgery, and their heart functions determined by 
echocardiography. 

In vivo isoproterenol delivery. Alzet model 2002 mini-osmotic pumps (Alzet) 
were filled with isoproterenol, and were set to deliver at 30 mg kg” ' per day for 14 
days in C57/BL6J wild-type mice. Pumps were then implanted into 9— 10-week- 
old male mice under isoflurane anaesthesia as previously described’°. 
Echocardiography. Transthoracic echocardiography was performed using a 30 
MHz probe and the Vevo 770 Ultrasound machine (Visualsonics) under mild 
anaesthesia with 1.5% isoflurane as previously described*’. 

Direct arterial blood pressure measurements in mice. The arterial blood pres- 
sure measurements in mice were performed as described previously**. Blood 
pressure was monitored for about 1 h in awake animals that were unconstrained. 
Isolation and maintenance of primary neonatal mouse cardiomyocytes. 
Isolation of neonatal mouse cardiomyocytes was performed as previously 
described’. Experiments involving HIF1a, SF3B1 and KHK-C were performed 
with maintenance medium supplemented with physiologic concentrations of 
glucose (5 mM) and fructose (25 M). The medium was changed daily. 
Pharmacological treatment was done with phenylepinephrine at a concentration 
of 100 1M and with isoproterenol at 10 1M for 48 h before preparation of SDS 
samples or processing for RNA isolation. 

Lentivirus production and transduction. Lentiviral particles were produced 
in HEK-293T cells as previously described*’. HEK-293T cells were purchased 
from American Type Culture Collection (ATCC) and regularly checked for 
mycoplasma contamination using a PCR-based myocoplasma detection kit 
(Sciencell). Cells were transduced with lentivirus 3 h after isolation by adding 
virus-containing supernatants to the cell culture medium and incubating cells at 
37 °C/5% CO, overnight. 

Transient transfection. The pcDNA3 Sf3b1 construct was transiently trans- 
fected by using the Lipofectamin 2000 (Invitrogen) in 2-day-old NMCs according 
to the manufacturer’s protocol. Cells were harvested and analysed 36 h after 
transfection. 

AAV9-fI/fl-shSf3b1. For the generation of the viral constructs, we cloned two 
shRNAs targeting Sf3b1 in a pSico vector where the U6 promoter driven shRNA 
expression is Cre-lox regulated**. The shRNA sequences were the following: 
AAV9-fl/fl-shSf3b1#1, sense (5’—3'):TGCATCGTCCTCCAAAGATTTTCAA 
GAGAAATCTTTGGAGGACGATGCTTTTTTC, antisense (5’—3'): TCGAG 
AAAAAAGCATCGTCCTCCAAAGATTTCTC TT GAAAATCTTTGGAGG 
ACGATGCA; AAV9-fl/fl-shSf3b1#2, sense (5'—3'): TG CTGCCTTACGTC 
AGATTATTCAAGAGATAATCTGACGTAAGGCAGCTTTTTTC, antisense 


(5'—3'): TCGAGAAAAAAGCTGCCTTACGTCAGATTATCTCTTGAATAA 
TCTGACGTAAGGCAGCA. 

After in vitro validation of shRNA efficiency using AdenoCre expression to 
induce Cre-mediated recombination and activation of shSf3b1 expression, the 
region bearing the U6 promoter, the shRNA and the TATA-lox sites of the pSico 
construct was amplified and ligated into Nhel/Xhol linearized AAV-bGH(+) 
vector. Viral amplification was done by Vectorbiolabs. AAV9 viruses were 
injected at a concentration of 1.2 X 10'* genome copies (GC) per kg body weight 
into tail veins of Mlc2v-cre* or Mlc2v-cre~ mice. 

Plasmid constructions. The pLenti-pgk-HIF1aAODD puro lentiviral expression 
vector was generated by subcloning the HIFlaAODD fragment from a pcDNA3- 
HA-HIF1aAODD(401-603) plasmid* as described previously*®. The KHK-A 
and KHK-C overexpression constructs were generated by subcloning KHK-A 
and KHK-C from the pET1la-KHK-A and pET1la-KHK-C expression con- 
structs (provided by A. Asipu) into the Gateway Entry vector pENTRI1a and by 
LR-reaction in the pLenti-pgk-puro lentiviral expression vector. As a correspond- 
ing control the pLenti-pgk puro empty vector was used. SF3A3 and SF3B3 over- 
expression constructs were generated by subcloning SF3A3 (from human 
ORFeome 8.1 collection, Thermo Scientific) or SF3B3 (from Addgene, plasmid 
#11241) in the pLKO.1 CMV puro construct provided by A. Ittner (ETH Ziirich), 
by using EcoRI and SalI (SF3A3) or EcoRI and Mlul (SF3B3) restriction sites. As a 
control vector the empty pLKO.1 CMV puro construct was used. Since the wild- 
type coding sequence of Sf3b1 is toxic to bacteria, we used a codon optimized 
Sf3b1 to perform overexpression experiments (the DNA and amino acid 
sequence can be found in Supplementary Data 1). First, we amplified Sf3b1 from 
mouse cDNA until the internal Xhol restriction site (1—1,568 bp) and subcloned 
it into a pcDNA3 vector by using EcoRI/Xhol restriction sites. The synthesis and 
codon optimisation of the second part (1,578—3,915 bp) was performed by 
Genscript. Finally we subcloned the two parts of Sf3b1 by double ligation in a 
pcDNA3 vector by using EcoRI, XhoI and Xbal restriction sites. 

Sf3b1 promoter-luciferase construct. 1.5 kilobases (kb)of the Sf3b1 promoter 
was amplified from mouse BAC genomic DNA and cloned into the pGL3 luci- 
ferase reporter vector (Stratagene). HRE mutants of the respective promoters 
were generated by recombinant PCR*””*. Sense and antisense primers were gen- 
erated bearing HRE mutations, which were used to amplify the mutant 5’ and 3’ 
regions of the promoter, respectively. The 5’ and 3’ products generated from the 
respective PCR reactions were pooled afterwards at a 1:1 ratio and re-amplified 
using primers targeting the 5’ and 3’ ends of the respective promoters. To check 
for the correctness of the HRE mutation in the right direction, HRE mutants of 
the respective promoters were verified by sequencing (Microsynth). 

shRNA knockdown of Khk-A and Khk-C. Khk-A- or Khk-C-specific shRNAs 
were designed using the Life Technologies BLOCK-iT RNAi Designer online tool 
(Life Technologies), searching for 21mer targets of either Khk-A- or Khk-C- 
specific transcripts. shRNAs were automatically compared to the mouse database 
using BLAST. As a loop sequence, the TRC-library loop sequence TCTGAG was 
used. Per isoform, three different shRNAs were chosen as promising, according to 
the program’s algorithm and cloned into the pLKO.1 lentiviral vector. The pres- 
ence of the shRNA was confirmed by sequencing (Microsynth). shRNAs were 
then tested in NMCs for knockdown efficiency and specificity for either Khk-A or 
Khk-C. After validation, shRNAs shKhk-A#3, referred as shKhk-A and shKhk- 
C#2, referred as shKhk-C were chosen for all experiments described. The 
sequences are as follows: shKhk-A, sense (5’-3'): CCGGGATGACCTCCGCC 
AACAT TCTCTCGAGAGAATGTTGGCGGAGGTCATCTTTTTG; shKhk-C, 
sense (5’-3’): CCGGGCTCTTGCTGCATCGTCAACACTCGAGTGTTGAC 
GATGCAGCAAGAGC TTTTTG. 

TRC lentiviral shRNA constructs. For lentiviral shRNA mediated knock- 
down the following shRNAs from the TRC library were used in a pLKO.1 con- 
struct: Sf3b1 (shSf3b1#1, TRCN0000295293; shSf3b1#2, TRCN0000295294), 
Sf3a3_ (shSf3a3#1, TRCN0000285834; shSf3a3#2, TRCN0000277028), Sf3b2 
(shSf3b2#3, TRCN0000123811; shSf3b2#4, TRCN0000123812), Sf3b3 (shSf3b3#5, 
TRCN0000311577; shSf3b3#11 TRCN0000109223), Glut5 (shGlut5#4, 
TRCN0000079392;  shGlut5#, ©TRCN0000079388), Hifla  (shHifla, 
TRCN0000232220), Khk (shKhk, TRCN0000416511). As a non-targeting 
shRNA construct the pLKO.1 vector SHC002 (Sigma) was used. 

RNA- isolation and reverse transcription. RNA was isolated with the RNeasy 
Mini Kit (Qiagen) and reverse-transcribed using a EcoDry Premix (OligoDT) 
(Clontech) according to the manufacturer’s instructions. Quantitative real- 
time PCR (qRT-PCR) reactions were prepared with Roche LightCycler 480 
SYBR Green I Master as recommended by the manufacturer and run on a 
LightCycler 480 (Roche). C, values were normalized to the housekeeping 
gene Hprt1. Primer sequence of qRT-PCR primers used in this study are as 
follows. Mouse: Hprtl 5'-GGGGCTGTACTGCTTAACCAG-3’ and 5’-TGC 
TCATCAGTTGCCACTTC-3’; Sf3b1 (primer pair A in Extended Data Fig. 3a) 
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5'-GCATCGTCCTCCAAAGATTGC-3’ and 5’-GCATTCATTTTAGGATCG 
GGAGT-3’; Sf3b1 (primer pair B in Extended Data Fig. 3a) 5'-CTCATGG 
GCTGTGCTATCTTGC-3’ and 5'-GTTGCTGCCTCAGCCAAAGC-3’; Sf3b1 
(codon optimized, primer pair C in Extended Data Fig. 3a) 5'’-CGTTAAA 
GGCCATCGTGAACG-3’ and 5’-CAATTTTCCTGCACTTTCTCATGC-3’; 
Sf3a3_ 5'-GAACGGCTTATGGATGTTATGGC-3’ and 5'-GGTGATCGGAA 
TTGATCTGGTC-3’; Sf3b2 5’-CATCCCGAACCTCCCAAAGG-3’ and 5’-GC 
TCCT ATCTCTGCCAATCTGG-3’; Sf3b3 5'-ATTTTGGAGTTACTACGCCC 
AG-3' and 5’-GCCATAAGTGACCGGATAACAC-3’; Khk-A 5'-CCAACATT 
CTGTGGACTTACG-3’ and 5’-CCTTCTCAAAGTCCTTAGCAG-3’; Khk-C 
5'-GCTGACTTCAGGCAGAGG-3’ and 5’-CCTTCTCAAAGTCC TTAGC 
AG-3'; Glut-1 5’-CAGTTCGGCTATAACACTGGTG-3’ and 5’-GCCCCCGAC 
AGAGAAGAT-3’; Pdk1 5'-GGACTTCGGGTCAGTGAATGC-3’ and 5’-CGC 
AGAAACATAAACGAGGTC-3’; Pkm2 5’-CGCCTGGACATTGACTCTG-3’ 
and 5'’-GAAATTCAGCCGAGCCACATT-3’; Glut 5'-CCAATATGGGTACA 
ACGTAGCTG-3’ and 5'’-CCAATATGGGTACAACGTAGCTG-3’; sk.a-Actin 
5'-TATGTGGCTATCCAGGCGGTG-3’ and 5’-CCCAGAATCCAACACGAT 
GC-3'; Nppa 5'-GTGCGGTGTCCAA CACAGAT-3’ and 5’-TCCAATCCTGTC 
AATCCTACCC-3’; Nppb 5'-AGTCCTTCG GTCTCAAGGCA-3’ and 5'-CCG 
ATCCGGTCTATCTTGTGC-3'; aMhe 5’-GCCCAGTACCTCCGAAAGTC-3’ 
and 5'-GCCTTAACATACTCCTTGTC-3’; BMhc 5'-GCATTCTCCTGCTGTT 
TCCTT-3’ and 5’-TGGATTCTCAAACGTGTCTAGTGA-3’; human: HPRT1 
5'-CCTGGCGTCGTGATTAGTGAT-3’ and 5'-AGACGTTCAGTCCTGTCC 
ATAA-3'; SF3B1 5'-GTGGGCCTCGATTCTACAGG-3’ and 5’-GATGTCAC 
GTATCCAGCAAATCT-3'; KHK-A 5'-GCTATTCTGTGGACCTACGCT-3’ 
and 5’-CAATGTGGATCCACTTGAACTG-3’ and KHK-C 5'-CTCCTGCT 
GCATCATCAAC-3’ and 5’-CAATGTGGATCCACTTGAACTG-3’; GLUTI1 
5'-GGCCAAGAGTGTGCTAAAGAA-3’ and 5’-ACAGCGTTGATGCCAGA 
CAG-3'; PDK1 5’-CTGTGATACGGATCAGAAACCG-3’ and 5'-TCCACCAA 
ACAATAAAGAGTGCT-3'; GLUTS 5'-TCCAGTATGGGTACAACGTGG-3’ 
and 5'-CATGGACACGGTTACAGACCA-3’; AKRIB1 5'-TTTTCCCATTGG 
ATGAGTCGG-3’ and 5’-CCTGGAGATGGTTGAAGTTGG-3’; NPPA 5'-CA 
ACGCAGACCTGATGGATTT-3' and 5'-AGCCCCCGCTTCTTCATTC-3’; 
NPPB 5'-TGGAAACGTCCGGGTTACAG-3’ and 5’-CTGATCCGGTCCAT 
CTTCCT-3'; «MHC 5'-GCCCTTTGACATTCGCACTG-3’ and 5'-CGGGA 
CAAAATCTTGGCTTTGA-3’; BMHC 5'-ACTGCCGAGACCGAGTATG-3’ 
and 5'-GCGATCCTTGAGGTTGTAGAGC-3’. 
Splice-junction microarrays. Total RNA was isolated from NMCs infected with 
shNs and shHif1« lentiviruses and cultured in either 21% or 3% O3.. In another 
experiment NMCs were transduced with shNs and shSf3b1 lentiviruses and 
cultured at 3% O, before RNA was isolated. Complementary DNA samples were 
prepared using the GeneChip WT cDNA Synthesis and Amplification Kit 
(Affymetrix). cDNA was hybridized to AltSplice splice-junction microarrays 
and resulting data were analysed with version 3 of ASPIRE software as previously 
described”. The unprocessed cel files with microarray data are available from 
ArrayExpress with accession numbers E-MTAB-2652 and E-MTAB-3203, 
respectively. 
Luciferase assays. The luciferase assay was performed from material of NMCs 
using a luciferase kit (Promega) as recommended by the manufacturer and ana- 
lysed on the MicroLumatPlus LB 96V (Berthold Technologies). 
Chromatin and RNA immunoprecipitations. ChIP assays were performed with 
material from NMCs and the assays carried out using the ChIP-IT Kit (Active 
Motif) according to the manufacturer’s instructions. ChIP was performed with a 
ChIP-grade antibody from Abcam. In silico promoter analyses were performed 
using MatInspector and DiAlignTF (Genomatix). Sequences of primers used are 
5'-TTTCGCTGGCTTTTTAAGGA-3’ and 5’-ATCTTCGCCATTTTGTCCAC-3’. 
RNA immunoprecipitation assays were performed with material from NMCs 
using a RNA-ChIP-IT kit from Active Motif as recommended by the manufac- 
turer. RNA immunoprecipitation was performed using a Sf3b1 antibody from 
MBL International. Sequences of primers used to detect potential binding of 
Sf3b1 on the branch points of Khk-A and Khk-C are the following: branch point 
primer for Khk-A, 5'- CTACTTCCCTCTGCTCAGCC-3’ and 5’-GGTCATCC 
AGGACAAAATGC-3'; branch point primer for Khk-C, 5'-GGACCCTTC 
TTCTCTGAGGC-3' and 5’-CACTTGAGACACATCCACGC-3’. 
Immunoblotting. Dissected hearts and NMCs lysates were homogenized by 
freeze slamming and solubilized in a modified SDS sample buffer sonicated 
and boiled for 5 min*°). Protein lysates were resolved on 8%, 10% or 15% poly- 
acrylamide minigels (BioRad) and transferred onto nitrocellulose membrane (GE 
Healthcare) by wet or semi-dry transfer. The following antibodies were used for 
immunoblotting rabbit HIFla (catalogue no. NB-100-47; Novus Biologicals), a 
rabbit SF3B1 (catalogue no. ab172634; Abcam), a rabbit-KHK (catalogue no. 
HPA007040; Sigma), a rabbit KHK-C (custom made by Eurogentec) a rabbit 
Cre (catalogue no. 69050; Novagen), a rabbit SF3A3 (catalogue no. ab140605), 
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a mouse SF3B2 (catalogue no. ab56800), a rabbit SF3B3 (catalogue no. NBP2- 
20258) mouse o-actinin (catalogue no. A7811; Sigma) and a chicken-GFP 
(catalogue no. GFP-1020; Aves Labs). Immunodetection and visualization of 
signals by chemiluminescence was carried out as described”, or by using the 
ImageQuant LAS 4000 mini (GE Healthcare). Immunoblots for Khk-A/C and 
Khk-C were visualized using SuperSignal West Pico Chemiluminescent Substrate 
(Thermo Scientific). 
Ketohexokinase splicing assay I. This Khk splicing assay including amplification 
of part of the Khk cDNA and the identified alternative spliced region was adapted 
from a splice assay for PKM1/M2 isoforms”’. RNA isolated from NMCs trans- 
duced with shNs or shSf3b1 lentiviruses after treatment with isoproterenol or 
HIFlaAODD and RNA isolated from sham- or 1K1C-treated wild-type 
left ventricles was reverse-transcribed then amplified using a forward primer 
(5'-GAAGCAGATCCTGTGCGTG-3’) that bound to shared Khk A/C exon 1 
and a reverse primer (5'-ATTGTGCTCCTCTATCCGC-3’) that bound to 
shared Khk A/C exon 5 sequences. The 468-bp PCR products representing both 
Khk-A and Khk-C transcripts were then digested with HinclI and run on a 1.5% 
agarose gel, after which the 468 bp (uncut Khk-A-specific amplicon), 294 bp and 
174 bp (Hincll-cleaved Khk-C-specific amplicon) products were quantitated 
using Image] software. 
Ketohexokinase splicing assay II. To analyse KHK isoforms in human and 
mouse hearts or NMCs RT-PCR was performed. Total RNA was isolated from 
hearts or NMCs and reverse-transcribed into cDNA using the RNA to cDNA 
EcoDry Premix (OligoDT) kit (Clontech). Full length KHK-A/C cDNA was amp- 
lified by RT-PCR, loaded onto an agarose gel and purified by gel extraction. One 
hundred nanograms of full-length KHK-A/C cDNA was used as a template to 
amplify KHK-A and KHK-C specific fragments, respectively, using primer pairs 
consisting of a common KHK A/C forward and KHK-A- or KHK-C-specific 
reverse primer set. To be able to distinguish between KHK-A and KHK-C iso- 
forms, the corresponding amplicons differed in length. The influence of fragment 
length on fragment amplification was minimised by designing two sets of primer 
pairs for each isoform (primer set 1 vs primer set 2): Khk-A, < Khk-C, and Khk- 
Az > Khk-C;. The primer sequences are the following: primer pair 1 (human): 
KHK-A/C 5'-GAGAAGCAGATCCTGTGC-3’, KHK-A 5’-AGCGTAGGTCC 
ACAGAATAGC-3’ and KHK-C 5'-GGAGTTGTTGATGATGCAGC-3’; pri- 
mer pair 2 (human): KHK-A 5'-GCTATTCTGTGGACCTACGCT-3’ and 
KHK-C 5'-CAGTTCAAGTGGATCCACATTG-3’; KHK-A/C 5'-GGTGTTGT 
GTGCGTCTATCC-3’; primer pair 1 (mouse); Khk-A/C 5’-GAAGCAGATC 
CTGTGCGTG-3', Khk-A 5'-GGTCTGAAGGACCACATATCG-3’ and Khk-C 
5'-CATTGGAGTTGTTGACGATGC-3’; primer pair 2 (mouse): Khk-A 5'-CAT 
TCTGTGGACTTACGATATGTG-3’ and Khk-C 5'-GTACCATTATACTCT 
ACGACACGAAC-3’; Khk-A/C 5'-GGTGTTGTGTGCGTCTATCC-3’. 
Ketohexokinase activity assay. The ketohexokinase activity assay*’ was modified 
by adding Roche Complete Protease Inhibitor Cocktail and Roche PhosSTOP to 
the reaction mixture, addition of 140 ug NMCs protein per reaction and incub- 
ating for 2 h at 37°C. Spectrophotometric absorbance measurements of the 
remaining fructose were performed in 96-well plates at 515 nm using a Spectra 
MAX 190 plate reader (Molecular Devices). Wild-type liver protein was used as a 
positive control and reactions without protein were used as negative controls. 
Metabolomics. 4 X 10° NMCs were cultured per 3-cm dish. Cells were washed 
with PBS and whole-cell culture plates were snap frozen on dry ice. Metabolites 
were extracted twice from plates with hot (75 °C) 70% ethanol. Targeted analysis 
of hexose-phosphates levels and of their '*C-labelling pattern was performed 
by ion-pairing, reverse phase, ultra-performance liquid chromatography- 
tandem mass spectrometry on a Thermo Quantum Ultra instrument (Thermo 
Scientific)”. Technical quadruplicates were merged using their mean. 
Untargeted analysis of metabolites was performed by flow injection-time-of- 
flight mass spectrometry on a Agilent 6550 Instrument in negative mode 4 GHz, 
high resolution in a m/z range of 50-1,000 as reported earlier*’. Ions were anno- 
tated to ionic derivatives of the metabolites listed in the KEGG database (organ- 
ism hsa) using 0.001 atomic mass units (a.m.u.) tolerance. Data processing and 
analysis was done with Matlab. 
Immunofluorescent stainings. Immunofluorescent stainings were performed as 
described previously’®. Cells were permeabilized and incubated with sarcomeric 
a-actinin antibody (Sigma) diluted 1:500 in 2% horse serum, 0.025% TritonX- 
100/PBS overnight at 4 °C. Cells were washed with PBS and incubated with 4’,6- 
diamidino-2-phenylindole (DAPI), Phalloidin and AlexaFluor488 anti-mouse 
secondary antibody for 1 h at room temperature. Afterwards, stainings were 
washed and mounted in Lisbeth medium. 
[?H]leucine incorporation assay. Cardiomyocyte hypertrophy was evaluated by 
measuring [*H]leucine incorporation into newly synthesized proteins. Cells 
were counted in a Neubauer chamber after trypsinization and scintillation counts 
were normalized to absolute cell number. 
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Ex vivo [*H]fructose/[!*C] (deoxy)-glucose uptake. Animals were euthanized 
8 weeks after surgery and hearts were shortly placed into 1 M KCl solution. 
The left ventricle were cut into equal tissue pieces, dried on tissue, weighted 
and placed into 6-well plates with 2 ml medium (serum-free DMEM, 5 mM 
glucose and 8 |1M fructose). Tissue explants were incubated in the same medium 
containing 1 mCi ml! [H] fructose (specific activity 5 Ci mmol” ') and 0.1 mCi 
ml’ ['*C]deoxy-p-glucose (specific activity 58 mCi mmol’) and harvested at 
respective time points. Harvested explants were washed two times with PBS and 
dissolved in 0.5 ml 1 M KOH (70°C, 10 min, shaking). 5 ml scintillation liquid 
was added and radioactivity was measured in a Beckman LS6500 scintillation 
counter. The radioactivity counts were normalized to tissue weight. 

In vivo [?H]fructose/[!*C] (deoxy)-glucose uptake and biodistribution. A mix- 
ture of 20 pl 1 mCi ml [*H] fructose (specific activity 5 Ci mmol‘) and 200 pl 
0.1 mCi ml~! ["C]deoxy-p-glucose (specific activity 55 mCi mmol~') was 
applied to sham- or TAC-treated C57/BI6 J mice at 3 weeks post operation by 
oral gavage. After 45 min animals were weighed and euthanized. Denoted organs 
were collected and weighed. Tissue was digested with 4 M ethanolic KOH, radio- 
activity was measured in a Beckman LS6500 scintillation counter (Beckman) and 
calculated as disintegrations per minute (d.p.m.) per mg tissue weight. 
Fructose content. Heart fructose content was measured using the fructose 
colorimetric/fluorometric assays kit according manufacturer’s instructions 
(Biovision) and as described previously*. Values were normalized to protein 
content. 

(?H] fructose incorporation into RNA, DNA and protein. 4 < 10° NMCs were 
seeded in 3-cm dishes and transduced in quadruplicates. On day 3, NMCs were 
incubated in maintenance medium containing 5 mM glucose, 8 1M unlabelled 
Fructose and 1 mCi ml” [*H] fructose (specific activity 5 Ci mmol’) overnight. 
NMCs were washed three times with PBS before cell lysis was performed with 
0.7 ml TRIzol (Invitrogen). Fractionation into RNA, DNA and protein was 
performed according to the manufacturer’s instructions with adaptations of the 
volumes according to the starting volume. 

ADP/ATP Assay. 5 X 10* NMCs were cultured in a 96-well format and treated as 
denoted. After 3 days in culture NMCs were analysed using a Bioassay Systems 
Enzylight ATP:ADP ratio kit (ELDT-100, Bioassay Systems) according to man- 
ufacturer’s instructions. Cellular ATP from 10 tl of lysate reacted with p-luciferin 
to generate a light signal that was proportional to the intracellular ATP concen- 
tration. In a second step, ADP was converted to ATP by an enzymatic reaction, 
and reacting with p-luciferin as in the first step. Luminometry was analysed using 
a MicroLumatPlus LB 96V (Berthold Technologies). 

Myocardial ATP content. The level of myocardial ATP was measured by using a 
colorimetric/fluorometric assay kit (Abcam) according to the manufacturer’s 
instructions. Left-ventricular samples of about 10 mg weight were used as a 
starting material. The absorbance was read at 550 nm using a Tecan Infinite 
M1000 Pro plate reader and ATP content was calculated based on a standard 
curve generated from ATP-standards on the same 96-well plate. Values were 
normalized to protein concentrations obtained from Bradford assays. 

(?H] fructose incorporation into lipids. 4 X 10° cells were seeded in 3-cm dishes 
and transduced in quadruplicates. On day 3, cells were loaded with 0.25 mM oleic 
acid overnight and 1 pCi ml! [?H] fructose (specific activity 5.0 Ci mmol ') was 
added to the medium. On day 4, cells were washed twice with PBS and lipids were 
extracted by hexane-isopropanol extraction*®. Protein concentration was mea- 
sured by BCA protein assay and radioactivity count was normalized to protein 
content. 

Oil Red O staining. Oil Red O staining was combined with immunofluorescence 
according to a protocol in ref. 47. Spot-detection was performed with the Anchor 
image analysis platform (http://www.microscopyimages.org/). Nuclei are seg- 
mented by first thresholding the (Gaussian-blurred) DAPI channel with Otsu’s 
method, and then a watershed segmentation on the inverted Euclidian distance 
transform. Neighbouring segments are merged to increase an ellipticity score. A 
merge occurs if: ell(xy) > 0.5[ ell(x) + ell(y)], where xy is the merge of segments x 
and y. Segments with an area >50 jum” are deemed nuclei. Cells are segmented by 
thresholding the histogram-stretched FITC channel, combining with the nuclei 
binary-image, and performing ultimate erosion using the segmented nuclei as 
seeds. Spots are segmented by first performing rolling-ball background correc- 
tion, thresholding, and then a watershed segmentation on the inverted intensity 
after median filtering. Segments with an area >0.2 jum? and a minimum gradient 
signal are deemed spots. 


Nile red staining. 4 < 10° NMCs were incubated with 0.25 mM oleic acid 
(Sigma-Aldrich) overnight and stained with 10 pg ml‘ Nile red (Sigma- 
Aldrich) for 10 min in PBS**. Values were normalized to cell number. 

Cell size quantification. NMCs were stained after 3 days of culturing for sarco- 
meric o-actinin (1:500, Sigma), phalloidin and DAPI. Pictures of all channels 
were taken using a 20 magnification. Using the software Cell Profiler, sarco- 
meric o-actinin-positive cells were identified and cell area was quantified. Multi- 
nucleated cells were counted manually and average area per cell was corrected 
taking into account the number of multi-nucleated cells. 

Measurements of extracellular acidification and oxygen consumption rate. 
1 X 10° NMCs per well were seeded in a 24-well Seahorse cell culture plate 
(SeaHorse Bioscience) and assayed at day 3. ECAR and OCR were measured in 
parallel using a SeaHorse XF24 Flux Analyzer (SeaHorse Bioscience) in NMCs 
incubated in low-buffered Krebs-Henseleit buffer, supplemented with glucose 
(5 mM) and fructose (25 uM). After five consecutive basal measurements of 
ECAR to measure glycolysis related lactate production, 0.25 mM oleic acid was 
added and oxygen consumption rates were measured again for five consecutive 
times. Finally, 1 uM carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone was 
injected to stimulate maximal OCR in NMCs, reflecting mitochondrial reserve 
capacity. 

Statistical analysis. For statistical analyses, unpaired, two-tailed Student’s t-tests 
(Excel) or one-way ANOVA analyses followed by a Dunnet’s multiple compar- 
ison post-test were used as indicated in the respective figure legends. No statistical 
methods were used to predetermine sample size. 
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Extended Data Figure 1| HIFlo induces SF3B1 gene expression that 
mediates KHK-C isoform expression. a, Schematic experimental workflow. 
b, Comparison of CIQBP, HNRNPH3, JMJD6 and SF3B1 promoter sequences 
of human and mouse illustrate putative HREs up to 1,000 bp upstream of 
the transcription start site (TSS). The core consensus HRE motif is shown in 
colour. c, d, qPCR (c) and immunoblotting (d) for Sf3b1 expression in NMCs 
transduced with scrambled shNs viruses or viruses bearing two distinct 
shRNAs against Sf3b1. All values obtained by qPCR are presented in relation to 
Sf3b1 expression in shNs (set as 1.0) (n = 3 biological replicates per group). 
e, RT-PCR analysis of Khk mRNA as in Fig. 1 g in NMCs transduced as 
indicated. f, RT-PCR analyses of Khk mRNA from NMCs infected as indicated 
and stimulated with mock or isoproterenol (ISO). PCR products were digested 
with HinclI as in Fig. 1c . Splicing is quantified as percent of the Khk-C 
isoform (%Khk-C). g, RT-PCR based assessment of Khk-A and Khk-C isoform 
expression as done in Fig. 1 g. h, i, Ventricular samples of sham- or 1K1C- 
operated mice were analysed for Khk isoform expression (h) as done in Fig. 1 g 
or Sf3b1, Khk-A and Khk-C mRNA expression (i) by qPCR; n = 6 for each 
group. j, k, Ventricular samples of sham- or TAC-treated mice (j) and NaCl- or 


GLUT1 PDK1 VEGFA LDHA 


isoproterenol-treated mice (k) were analysed for expression of Sf3b1, Khk-A 
and Khk-C mRNA expression by qPCR (n = 6 for each individual group). 

1, NMCs transduced as specified were validated for protein expression of 
indicated proteins by using immunoblots. In addition lysates of NMCs 
overexpressing KHK-A or KHK-C were processed for immunoblotting 

Khk antibodies. m, n, Ventricular samples of sham- or TAC-treated (m) and 
sham- or isoproterenol-treated mice (n) were analysed for indicated proteins 
by immunoblotting. o, Probes from left ventricles of 1K1C-, TAC- or 
isoproterenol-treated mice and their corresponding controls were analysed for 
mRNA expression of Hiflo targets. All values were presented in relation to 
sham-operated controls set as 1.0 (n = 6 for each individual group). p, q, Left- 
ventricular biopsies of HCM patients and healthy controls were analysed for 
mRNA expression of SF3B1, KHK-A and KHK-C (p) and HIF 1« targets 

(q). All values are presented in relation to healthy controls set as 1.0 (n = 6 
for controls and n = 16 for patient samples). Error bars are s.d. (c) or s.e.m. 
(i-k, o-q). *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired t-test 
(c, i-k, o-q). 
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Extended Data Figure 2 | SF3B1 is a HIF1a-target gene, whose product 
regulates growth and metabolic shift in vitro. a, NMCs infected as denoted, 
were stimulated with either mock, phenylephrine and isoproterenol, 
respectively. Sf3b1 mRNA expression was examined by qPCR. (n = 3 biological 
replicates). b, NMCs were transduced as indicated and processed for qPCR 
analysis of Sf3b1 mRNA. (n = 3 biological replicates). c, Comparative analysis 
of a conserved HRE (shown in red) in the promoter of SF3B1 of diverse 
mammalian species located 122 bp downstream of the transcription start 

site (TSS). d-g, Immunoblot detection of indicated proteins of NMUCs 
transduced with shNs, and shSf3b lentiviruses treated with phenylephrine or 
isoproterenol (d) co-infected with empty vector or HIFIaAODD (e) cultured 
under normoxic (21% O2) or hypoxic (3% O3) conditions (f) and co-transduced 
with shVhl lentiviruses (g). h-o, Evaluation of [*H]leucine incorporation 
and the myosin heavy chain «/$ mRNA expression ratio in NMCs treated with 
either phenylephrine or isoproterenol (h, 1), HIFlaAODD overexpression 

(i, m), hypoxia (j, n) or pVhl deletion (k, 0), with or without shSf3b1 mediated 
Sf3b1 depletion. Obtained values are shown relative to incorporated 
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radioactivity in control (shNS) NMCs (set as 1.0) (n = 4 per individual group). 
p, NMCs treated as in i were microscopically analysed for cell size using 
CellProfiler. Data represents 1 of 3 independent experiments with 
approximately 200 analysed cells per experiment. q, Lysates of NMCs treated 
as indicated, were assessed for protein expression of specified proteins by 
immunoblots. r, Evaluation of [*H]leucine incorporation of NMCs infected as 
indicated. Obtained values are shown relative to incorporated radioactivity in 
control (shNs/empty) NMCs (set as 1.0) (n = 3 per individual group). 

s-z, Assessment of extracellular acidification rate (ECAR), and oleic acid- and 
carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone (FCCP)-induced 
oxygen consumption rate (OCR) in NMCs transduced and treated as 
indicated. Highlighted ECAR measurements were done at baseline. All data 
are compared to shNs-transduced NMCs. (n = 8 biological replicates in 

the group shNs + isoproterenol; n = 10 biological replicates in all other 
groups). Error bars are s.d. (a, b, h-o, r) or s.e.m. (p, sz). *P < 0.05; 

**P < 0.01; ***P < 0.001. Two-tailed unpaired t-test (b, t, u, w-z). One-way 
ANOVA followed by Dunnett’s multiple comparison post-test (a, h-p, r). 
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Extended Data Figure 3 | Influence of U2snRNP-complex members on 
KHK isoform expression and validation of KHK-C-mediated fructose 
uptake and metabolism. a, NMCs were transduced with either shNs or 
shSf3b1 and transfected with codon optimized Sf3b1 or empty construct and 
expression of denoted proteins was assessed. b, NMCs were transduced with 
either shNs or shSf3b1 and transfected with codon-optimized Sf3b1 or empty 
construct, respectively. Exogenous Sf3b1-expression was evaluated by qRT- 
PCR. Three different primer pairs were used. Primer pair A detects both 
endogenous and exogenous codon-optimized Sf3b1 (top). Primer pair B 
recognizes Sf3b1 close to the shRNA binding site and is specific for endogenous 
Sf3b1 (left). Primer pair C is located in the codon-optimized region (right). 
Values were normalized to shNs/empty-vector-infected NMCs (n = 3 
biological replicates per group). c, [?H]leucine incorporation normalized to cell 
number in NMCs treated as in b (n = 4 biological replicates per group). 

d, NMCs transduced with empty vector or human SF3A3 or SF3B3 were 
assessed for mRNA expression of Khk-A and Khk-C by qRT-PCR. Values were 
normalized to empty-vector-transduced NMCs (n = 3 biological replicates per 
group). e, NMCs transduced as in d were immunoblotted for indicated 
proteins. f, NMCs transduced as indicated were probed for mRNA expression 
of Khk-A or Khk-C by qRT-PCR. Values were normalized to shNs/empty- 
vector-transduced NMCs (n = 3 biological replicates). g, NMCs treated with 
either scrambled shRNA or two different shRNAs targeting either Sf3a3, Sf3b2 


or Sf3b3 were co-transduced with empty vector or HIFlaAODD. Protein 
expression of Khk-A/C or Khk-C specifically was evaluated by 
immunoblotting. h, Evaluation of [’H]leucine incorporation of NMCs infected 
as in d normalized to empty-vector-transduced NMCs (n = 4 biological 
replicates per group). i, Lysates of NMCs treated as indicated were 
immunoblotted for specified proteins. j, KHK-activity in lysates of NMCs 
transduced with a combination of lentiviruses expressing either shKhk 
targeting mouse Khk and human KHK-A or shKhk and human KHK-C ora 
scrambled shRNA control together with an empty vector. k, Evaluation of the 
kinetics of ['°C]fructose-1-phosphate accumulation in NMCs treated as in 

i at the respective time points (n = 3 biological replicates). 1, GlutS mRNA 
expression in NMCs transduced as indicated. (n = 3 biological replicates per 
group). m, NMCs transduced as indicated were microscopically analysed for 
cell size. Data represents 1 of 3 experiments with approximately 50 analysed 
cells per experiment n, Khk-A- and Khk-C-specific shRNAs were evaluated 
for their potential to inhibit Khk-A and Khk-C isoform expression, 
respectively. NMCs were infected with shNs, shKhk-A or shKhk-C lentiviruses. 
Data are shown in relation to shNs NMCs (set to 1.0) (n = 3 biological 
replicates). Error bars are s.d. (b-d, f, h, k, 1, n) or s.e.m. (m). *P < 0.05; 
**P < 0.01; ***P < 0.001. One-way ANOVA followed by Dunnett’s multiple 
comparison post-test (b-d, f, k, 1, m, n). 
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Extended Data Figure 4 | Downstream metabolic effects of fructose 
metabolism in NMCs. a, log, fold change of metabolites in NMCs 
overexpressing KHK-C and cultured with glucose or glucose and fructose 
compared to the corresponding control transduced NMCs (n = 4 biological 
replicates per group). b, Incorporation of [*H] fructose into RNA (top), DNA 
(middle) and protein (bottom) of NMCs treated as indicated (n = 4 biological 
replicates per group). c, Uptake of [“*C]deoxyglucose in NMCs infected as in 
bat depicted time points (n = 4 biological replicates per group). d, [’H]leucine 
incorporation in NMCs transduced with shNs or shKhk and co-transduced 
with either empty overexpression vector or KHK-A or KHK-C respectively. 
NMCs were cultured in media with increasing fructose concentrations, under 
physiologic glucose amounts or with increasing glucose concentrations under 
physiologic fructose amounts. Data are presented relative to shNs/empty- 
vector-transduced NMCs at 5 mM glucose/0 1M fructose (n = 4 biological 
replicates). e, ADP/ATP ratios in NMCs transduced as indicated. (n = 4 
biological replicates, data show 1 of 3 representative experiments). 


f, Immunofluorescence images of NMCs transduced with empty vector (upper 
panel) or HIFI“AODD (middle panel) lentiviruses or exposed to hypoxia 
(lower panel) were additionally transduced as indicated. Prior to staining for 
sarcomeric o-actinin, Oil Red O and DAPI, NMCs were incubated with 

oleic acid. g, Quantification of lipid droplets/cell in NMCs of immuno- 
fluorescent images shown in f. h, Ratio of lipid droplets/cell area of NMCs 
shown in g. i-l, NMCs were infected as indicated and processed for oxygen 
consumption rate (OCR) and extracellular acidification rate (ECAR) 
measurements under basal conditions or after injection of OA or FCCP. 
Depicted are rates expressed as OCR to ECAR ratios (upper panels) or 
individual ECAR (for measurements at baseline) or OCR values (n = 8 
biological replicates for HIF1aAODD/shKhk-A; n = 10 biological replicates 
for all other groups). Error bars are s.d. (b—e) or s.e.m. (g—l). *P < 0.05; 
**P < 0.01; ***P < 0.001. One-way ANOVA followed by Dunnett’s multiple 
comparison post-test (b—e, g, h). Two-tailed unpaired t-test (i—l). 
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Extended Data Figure 5 | Fructose and glucose uptake ex vivo and 
validation of AAV9-f1/fl-shSf3b1 viruses. a, Ratio of left-ventricular weight to 
body weight of sham- or TAC-treated C57/Bl6J mice; (n = 7 per group). 
b,c, [*H] fructose (b) and ["4C] deoxy-glucose (c) were co-applied by oral gavage 
and biodistribution was measured at 45 min post injection (n = 7 per group). 
d, Ratio of left-ventricular weight to tibia length (LV/TL) of sham- or 1K1C- 
treated C57/B16J wild-type mice (n = 6 for sham and n = 7 for 1K1C-treated 
mice). e, Fructose levels in probes from left ventricles of indicated mice as 
determined by a quantitative colorimetric assay, normalized to protein amount 
(n = 4 for sham and n = 5 for hearts of 1K1C-treated mice). f, g, Freshly 
isolated left-ventricular pieces of sham- or 1K1C-treated mice incubated in 
medium containing either ['“C] fructose (f) or ['4C]deoxyglucose (g) as 
indicated were processed for uptake measurements. Counts were normalized to 
tissue weight (n = 6 per time point and treatment). h-j, qPCR analyses of Glut5 
mRNA expression in left ventricles of 1K1C- (h), TAC- (i) or isoproterenol- 
treated (j) mice. All values are presented in relation to Glut5 expression in 
shams (set as 1.0). n = 5 for sham-treated mice as controls for TAC-treated 
mice (n = 6), all other groups n = 7.k,1, Relative mRNA expression of GLUT5 


and aldose reductase (AKR1B1) in left-ventricular biopsies of patients 
diagnosed with aortic stenosis (k) or HCM (1) in relation to healthy controls 
(set as 1.0) was analysed by qPCR (n = 6 for controls; n = 16 for HCM and 
n = 17 for patients with aortic stenosis). m, Integration of AAV9-fl/fl- 
shS$f3b1#1 viruses in the myocardium of Mlc2-cre” or Mlc2v-cre’ mice was 
examined 12 weeks after intravenous injection by confocal microscopy (using 
20X magnification) of left-ventricular cryoslices stained against GFP and 
sarcomeric -actinin and counterstained for DAPI. n, Ventricular samples of 
Mic2v-cre~ or Mlc2-cre* mice transduced with AAV9-fI/fl-shSf3b1#1 or 
AAV9-fl/fl-shSf3b1#2 viruses were assessed for expression of indicated 
proteins by immunoblotting. o, p, Ratio of left-ventricular weight to tibia length 
(o) and ejection fraction (%EF) (p) of sham- or TAC-treated Mlc2v-cre 

and Mlc2v-cre* mice injected with AAV9-fl/fl-shSf3b1#1 (number of mice 
per group is given in Extended Data Table 1). q, Lysates of hearts from 
sham- or TAC-operated Mlc2v-cre™ and Mlc2v-cre* mice transduced with 
AAV9-fl/fl-shSf3b1#1 viruses were processed for immunoblotting with 
antibodies as denoted. Error bars are s.e.m. (a-l, 0, p). *P < 0.05; **P < 0.01; 
***D < 0.001, two-tailed unpaired t-test (a-l, 0, p). 
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Extended Data Figure 6 | Pathologic growth is dependent on HIF1a-driven 
and SF3B1-mediated KHK-C expression. a-f, Ratio of left-ventricular weight 
to tibia length (LV/TL) (a), mRNA expression of the hypertrophic markers 
Nppa (b), myosin heavy chain B (SMhc) (c) and myosin heavy chain « («Mhc) 
(d), ejection fraction (%EF) (e) and left-ventricular ATP content (f) of sham- or 
TAC-treated Khk-A*’* and Khk-A~’~ mice. g-l, Ratio of left-ventricular 
weight to tibia length (LV/TL) (g), mRNA expression of the hypertrophic 
markers Nppa (h), myosin heavy chain B (BMhc) (i) and myosin heavy chain 
a (aMhc) (j), ejection fraction (%EF) (k) and left-ventricular ATP content (1) of 
Khk-A/C*’* and Khk-A/C’/~ mice. m, n, Lysates from ventricles of sham- or 


TAC-operated Khk-A‘’* and Khk-A’~ mice (m) and Khk-A/C‘’* and 
Khk-A/C ‘~ mice (n) were processed for immunoblotting with antibodies as 
indicated. o-t, Assessment of hypertrophic marker gene expression (Nppa, 
BMhc and «Mhc) in ventricles of TAC- vs sham-treated Khk-A /~ mice and 
their corresponding Khk-A*’* controls (o-q) as well as Khk-A/C ‘~ mice 
and their corresponding Khk-A/C*’* controls (r-t). u, v, Mean blood pressure 
(BP) in mmHg of Khk-A‘/* and Khk-A ’~ (u) as well as Khk-A/C*’* and 
Khk-A/C /~ mice (v) 8 weeks after sham or 1K1C treatment. For a-l, o-v, the 
number of mice per group is given in Extended Data Table 2. Error bars are 
s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired t-test. 
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Extended Data Figure 7 | Model explaining how HIF1a. activation of 
SF3B1-dependent splicing of KHK regulates fructose and glucose 
metabolism to promote cardiac hypertrophy in response to pathologic 
stress. In this model, pathologic stress leads to increased expression of HIFla 
and HIF1o-dependent activation of genes encoding glycolytic enzymes and 
the splicing factor SF3B1. SF3B1, in turn, assembles at the branch-point 
sequence upstream of exon 3C of KHK pre-mRNA leading to an inclusion of 
this exon and KHK-C protein production. This shift in isoform expression 
from KHK-A to KHK-C in response to HIFla-SF3B1 pathway activation 
drives KHK-C-dependent fructose uptake via stimulation of GLUT5 
expression, the conversion of fructose to fructose-1-phosphate (F1P) and 
contributes simultaneously to the activation of glucose uptake and metabolism 
through a yet-to-be-determined mechanism. The model holds that the 
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unrestrained conversion of fructose to F1P by KHK-C limits ATP levels, 
thereby alleviating potential allosteric inhibition of phosphofructokinase (PFK) 
by ATP to maintain a high glycolytic flux. F1P is further metabolized to 
dehydroxyacetone phosphate (DHAP) and glyceraldehyde (GA). While DHAP 
serves as a precursor for glycerol synthesis, GA can be further converted to 
glyceraldehyde-3-phosphate (G3P), a key glycolytic intermediate. G3P can be 
channelled into the non-oxidative pentose phosphate pathway (PPP) 
supporting nucleic and amino acid biosynthesis. This metabolic constellation, 
created by the activation of the HIFla-SF3B1-KHK-C axis, increases 
macromolecular biosynthetic capacity essential for hypertrophic growth, 
steatosis and cardiac dysfunction. HKII and F-1,6-BP denotes hexokinase II 
and fructose-1,6-bisphosphate, respectively. 
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shSf3b1 viruses. 


Mlc2v-cre- 


AAV fl/fl shSf3b1#1 


Mic2v-cre+ 


AAV fl/fl shSf3b1#1 


sham 1K1C sham 1K1C 
n 4 3 5 3 
BW (g) 27.5 +0.6 27.3 40.7 27.3 40.6 27.8 40.8 
IVS;d (mm) 0.71 + 0.03 1.00 + 0.03** 0.70 + 0.02 0.80 + 0.04 
IVS;s (mm) 0.98 + 0.02 1.30 + 0.05** 1.04 + 0.09 1.08 + 0.08 
LVID;d (mm) 3.97 + 0.08 4.13 + 0.10 3.85 + 0.11 4.09 + 0.10 
LVID;s (mm) 3.08 + 0.10 3.30 + 0.11 2.86 + 0.15 2.98 + 0.18 
LVPW;d (mm) 0.69 + 0.01 0.85 + 0.03** 0.67 + 0.02 0.82 + 0.01 
LVPW;s (mm) 0.91 40.35 1.08 + 0.12 0.93 + 0.09 0.97 + 0.04 
AFS 27.8419 25.3 + 2.1 29.8 + 1.6 27.1428 
E/A 2.21 + 0.26 0.80 + 0.06*** 2172017 1.73 + 0.11 

Mlc2v-cre- Mic2v-cre+ 
AAV fl/fl shSf3b1#1 AAV fl/fl shSf3b1#1 

sham TAC sham TAC 
n 5 4 5 5 
BW (g) 274+40.8 28.0 + 0.9 26.5 + 0.5 26.9 + 0.3 
IVS;d (mm) 0.70 + 0.03 0.98 + 0.03*** 0.68 + 0.02 0.84 + 0.04 
IVS;s (mm) 0.96 + 0.06 1.37 + 0.04** 0.97 + 0.05 1.18 + 0.05 
LVID;d (mm) 3.94 + 0.06 4.04 + 0.09 3.82 + 0.06 3.68 + 0.06 
LVID;s (mm) 2.89 + 0.13 2.80 + 0.09 2.71 £0.13 2.54 + 0.11 
LVPW;d (mm) 0.68 + 0.02 0.98 + 0.04*** 0.70 + 0.02 0.82 + 0.03 
LVPW;s (mm) 0.97 + 0.06 1.37 + 0.10** 0.99 + 0.07 1.11 + 0.03 
*FS 30.442.3 19.9 + 3.9* 30.04 + 3.8 34.24 1.3 
E/A 1.62 + 0.09 1.72 + 0.23 1.68 + 0.02 1.74 +0.12 
Aortic Vel. (mm/s) 923 +58 3920+ 305*** 878 +72 3670+ 190*** 

Mlc2v-cre- Mlc2v-cre+ 
AAV fi/fl shSf3b1#2 AAV fl/fl shSf3b1#2 

sham TAC sham TAC 
n 5 4 5 5 
BW (g) 27.4408 28.0 + 0.9 26.5 + 0.5 26.9 +0.3 
IVS;d (mm) 0.70 + 0.03 0.98 + 0.03*** 0.68 + 0.02 0.84 + 0.04 
IVS;s (mm) 0.96 + 0.06 1.37 + 0.04** 0.97 + 0.05 1.18 + 0.05 
LVID;d (mm) 3.94 + 0.06 4.04 + 0.09 3.82 + 0.06 3.68 + 0.06 
LVID;s (mm) 2.89 + 0.13 2.80 + 0.09 2.71 40.13 2.54 +0.11 
LVPW;d (mm) 0.68 + 0.02 0.98 + 0.04*** 0.70 + 0.02 0.82 + 0.03 
LVPW;s (mm) 0.97 + 0.06 1.37 + 0.10** 0.99 + 0.07 1.11 + 0.03 
AFS 30.44 2.3 19.9 + 3.9* 30.04 + 3.8 34.24 1.3 
E/A 1.62 + 0.09 1.72 + 0.23 1.68 + 0.02 1.74 +0.12 
Aortic Vel. (mm/s) 923 +58 3920+ 305*** 878 +72 3670+ 190*** 
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Extended Data Table 1 | Echocardiographic analysis of sham-, 1K1C- or TAC-treated Mlc2v-cre~/-cre* mice injected with AAV9-fI/fl- 


BW, body weight; IVS;d, interventricular septum diameter in diastole, IVS;s, interventricular septum diameter in systole; LVID;d, left-ventricular inner diameter in diastole; LVID;s, left-ventricular inner diameter in 
systole; LVPW;d, left-ventricular posterior wall diameter in diastole, LVPW;s, left-ventricular posterior wall diameter in systole; %FS, fractional shortening; E/A, ratio of early (E) to late (A) ventricular filling velocities; 
Aortic Vel. = aortic velocity. Values shown are mean = s.e.m.; *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired t-test. 
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Extended Data Table 2 | Echocardiographic analysis of sham- or TAC-operated Khk-A‘t’*/Khk-A~“— and Khk-A/C*”*/Khk-A/C-“ mice. 


Khk-A+/+ Khk-A-/- 
sham TAC sham TAC 

n 6 6 5 6 
BW (g) 25.6 + 0.7 27.3 0.8 25.4+0.7 25.6 + 0.7 
IVS;d (mm) 0.73 + 0.01 0.98 + 0.02*** 0.72 + 0.01 1.00 + 0.03*** 
IVS;s (mm) 1.03 + 0.07 1.34 + 0.04** 1.12 + 0.01 1.36 + 0.06** 
LVID;d (mm) 3.85 + 0.08 3.80 + 0.06 3.75 + 0.10 3.73 + 0.07 
LVID;s (mm) 2.71 + 0.16 2.65 + 0.11 2.66 + 0.20 2.70 + 0.10 
LVPW;d (mm) 0.72 + 0.01 0.99 + 0.04*** 0.72 + 0.01 0.99 + 0.03*** 
LVPW;s (mm) 1.04 + 0.03 1.35 + 0.03*** 1.05 + 0.01 1.32 + 0.05*** 
%FS 34.0 + 2.1 26.3'+ 1.5* 34.3 + 1.7 26.0 + 1.2* 
E/A 1.38 + 0.11 2.20 + 0.05** 1.97+0.4 1.90 + 0.4 
Aortic Vel. (mm/s) 938 +55 4250+ 275*** 880 +68 3700+ 308*** 

Khk-A/C+/+ Khk-A/C-/- 

sham TAC sham TAC 

n 5 6 6 5 
BW (g) 25.8 + 0.6 24.9+0.8 24.51 + 0.6 24.4+0.6 
IVS;d (mm) 0.72 + 0.01 1.00 + 0.05** 0.64 + 0.04 0.83 + 0.06 
IVS;s (mm) 1.15 + 0.01 1.42 + 0.07* 1.06 + 0.07 1.19 + 0.13 
LVID;d (mm) 3.53 + 0.13 3.70 + 0.23 3.65 + 0.13 3.67 + 0.04 
LVID;s (mm) 2.28 + 0.27 2.61 + 0.33 2.49 + 0.17 2.45 + 0.10 
LVPW;d (mm) 0.69 + 0.03 1.00 + 0.05*** 0.67 + 0.02 0.81 + 0.04* 
LVPW;s (mm) 1.12 + 0.03 1.25 + 0.03* 1.00 + 0.06 1.11 + 0.07 
AWFS 35.5 + 3.0 27.5 + 1.6* 32.1 + 2.30 33.44 2.9 
E/A 1.51 + 0.09 1.56 + 0.17 1.77 + 0.10 1.64+0.21 
Aortic Vel. (mm/s) 928 +120 4020+ 232*** 937 +59 3600+ 292*** 


BW, body weight; IVS;d, interventricular septum diameter in diastole, IVS;s, interventricular septum diameter in systole; LVID;d, left-ventricular inner diameter in diastole; LVID;s, left-ventricular inner diameter in 
systole; LVPW;d, left-ventricular posterior wall diameter in diastole, LVPW;s, left-ventricular posterior wall diameter in systole; %FS, fractional shortening; E/A, ratio of early (E) to late (A) ventricular filling velocities; 
Aortic Vel. = aortic velocity. Values shown are mean + s.e.m.; *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired t-test. 
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Extended Data Table 3 | Echocardiographic analysis of sham- or 1K1C-operated Khk-A*’*/Khk-A~“— and Khk-A/C*’* /Khk-A/C-“~ mice. 


n 
BW (g) 

IVS;d (mm) 
IVS;s (mm) 
LVID:d (mm) 
LVID;s (mm) 
LVPW:d (mm) 
LVPW;s (mm) 
%FS 

E/A 


n 
BW (g) 

IVS;d (mm) 
IVS;s (mm) 
LVID;d (mm) 
LVID;s (mm) 
LVPW;d (mm) 
LVPW;s (mm) 
%FS 

E/A 


Khk-A+/+ Khk-A-/- 
sham 1K1C sham 1K1C 
5 6 5 6 
27.2+0.7 27.1+0.6 28.2 + 2.0 29.7 +0.5 
0.75 + 0.01 0.94 + 0.03*** 0.72 + 0.02 1.01 + 0.06** 
1.08 + 0.05 1.31 + 0.05** 1.00 + 0.04 1.31 + 0.09* 
4.02 +0.10 3.93 + 0.12 3.91 + 0.18 4.37 + 0.32 
3.02 + 0.08 3.03 + 0.14 2.90 + 0.20 3.45 + 0.40 
0.73 + 0.02 0.99 + 0.04*** 0.74 + 0.02 0.97 + 0.04** 
1.01 + 0.05 1.30 + 0.05** 1.06 + 0.05 1.29 + 0.06* 
27.1+1.5 22.6 + 1.4* 28.8 + 3.4 19.2 + 3.05* 
1.97+0.17 1.47 + 0.10" 1.83 + 0.55 1.27 + 0.14 

Khk-A/C+/+ Khk-A/C-/- 
sham 1K1C sham 1K1C 
5 6 6 5 
25.8 + 0.6 24.940.8 24.51 +0.6 24.4+0.6 
0.72 + 0.01 1.00 + 0.05** 0.64 + 0.04 0.83 + 0.06 
1.15 +0.01 1.42 + 0.07* 1.06 + 0.07 1.19 + 0.13 
3.53 + 0.13 3.70 + 0.23 3.65 + 0.13 3.67 + 0.04 
2.28 + 0.27 2.61 + 0.33 2.49 + 0.17 2.45 + 0.10 
0.69 + 0.03 1.00 + 0.05*** 0.67 + 0.02 0.81 + 0.04* 
1.12 + 0.03 1.25 + 0.03* 1.00 + 0.06 1.11 + 0.07 
35.5 + 3.0 27.5 + 1.6" 32.1 + 2.30 33.44 2.9 
1.51 +0.09 1.56 + 0.17 1.77 +0.10 1.64 40.21 


BW, body weight; IVS;d, interventricular septum diameter in diastole, IVS;s, interventricular septum diameter in systole; LVID;d, left-ventricular inner diameter in diastole; LVID;s, left-ventricular inner diameter in 
systole; LVPW;d, left-ventricular posterior wall diameter in diastole, LVPW;s, left-ventricular posterior wall diameter in systole; %FS, fractional shortening; E/A, ratio of early (E) to late (A) ventricular filling velocities; 
Aortic Vel. = aortic velocity. Values shown are mean + s.e.m.; *P < 0.05; **P < 0.01; ***P < 0.001; two-tailed unpaired t-test. 
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Atomic structure of the APC/C and its 
mechanism of protein ubiquitination 


Leifu Chang'*, Ziguo Zhang’, Jing Yang', Stephen H. McLaughlin’ & David Barford! 


The anaphase-promoting complex (APC/C) is a multimeric RING E3 ubiquitin ligase that controls chromosome 
segregation and mitotic exit. Its regulation by coactivator subunits, phosphorylation, the mitotic checkpoint complex 
and interphase early mitotic inhibitor 1 (Emil) ensures the correct order and timing of distinct cell-cycle transitions. 
Here we use cryo-electron microscopy to determine atomic structures of APC/C-coactivator complexes with either 
Emil or a UbcH10-ubiquitin conjugate. These structures define the architecture of all APC/C subunits, the position of 
the catalytic module and explain how Emil mediates inhibition of the two E2s UbcH10 and Ube2S. Definition of Cdhl 
interactions with the APC/C indicates how they are antagonized by Cdh1 phosphorylation. The structure of the APC/C 
with UbcH10-ubiquitin reveals insights into the initiating ubiquitination reaction. Our results provide a quantitative 
framework for the design of future experiments to investigate APC/C functions in vivo. 


The activities of the diverse proteins that orchestrate the sequential 
biochemical and morphological changes intrinsic to the cell division 
cycle are controlled through the integration of protein phosphoryla- 
tion, proteolysis and changes in gene expression. Two cullin-RING E3 
ubiquitin ligases, the APC/C and Skp1/Cull/F-box (SCF), catalyse 
the ubiquitination of multiple cell-cycle proteins to regulate their 
proteasome-mediated proteolysis. By regulating the ordered degrada- 
tion of substrates such as cyclins, securin, mitotic kinases, and micro- 
tubule motors and assembly factors, the APC/C controls sister 
chromatid segregation, cytokinesis and the initiation of chromosome 
duplication’. 

The APC/C is a large assembly comprising 19 subunits*. Its activity 
depends on the association of one of two coactivator subunits (either 
Cdc20 or Cdh1) that specify substrate recognition and stimulate the 
catalytic activity of APC/C-E2 complexes*’. Coactivators engage the 
APC/C through a conserved amino (N)-terminal C-box motif and a 
carboxy (C)-terminal Ile-Arg (IR) tail**. Cyclin-dependent kinase 
(CDK) phosphorylates the APC/C, promoting Cdc20 association*”. 
In contrast, Cdh1 is negatively regulated by phosphorylation®’°”’. 

APC/C activity is repressed by the spindle assembly checkpoint (SAC) 
to ensure accurate sister chromosome segregation’. The SAC effector, 
the mitotic checkpoint complex (MCC), inhibits mitotic APC/C“° 
whereas Emil inhibits APC/C“""' during interphase’. Similar to the 
MCC, Emil blocks destruction box (D box) recognition by APC/ 
C-coactivator complexes. However, Emil also antagonizes the two E2s, 
UbcH10 and Ube2S, that pair with the APC/C and which are responsible 
for catalysing ubiquitin chain initiation and elongation, respectively'’*””. 

A quantitative mechanistic understanding of the APC/C requires 
structural information to atomic resolution. We previously described 
the human APC/C@*!_substrate complex (APC/CCLEBS!1) from a 
cryo-electron microscopy (cryo-EM) reconstruction, allowing defini- 
tion of the overall architecture of the multi-subunit assembly’. Still, 
the atomic structures of several large and most small APC/C subunits, 
and the regulatory N-terminal domain (NTD) of Cdhl, were 
unknown, which together with the flexibility of the catalytic module 
comprising the RING subunit Apcl1 and the C-terminal domain of 
Apc2 (Apc2“"”) limited our understanding of the complex. 


Here, we used recombinant APC/C“*"! in complex with Emil 


(APC/C@*"! E™i1) to determine an atomic model of the APC/C to 
3.6 A resolution (Extended Data Figs 1 and 2 and Extended Data 
Table 1). This structure, together with an APC/ C@*"!_UbcH10 com- 
plex, reveals details of the initiating ubiquitination reaction and how 
Emil inhibits UbcH10 and Ube2S**"*. 


Atomic structure of APC/C©tt Em 


Fig. 1a, b shows two views of the atomic model of APC/ 
(Supplementary Video 1). A local resolution map indicates that most 
of the static regions of the complex are at ~3.2 A resolution (Extended 
Data Fig. 1). Specific regions of the molecule, including the catalytic 
module and the Cdh1*° domain are more mobile and visualized at 
lower resolution (Extended Data Fig. 1). 

Similar to APC/C@*"! 4"! (ref. 4), 60% of molecules are the coacti- 
vator-bound APC/C“*" ="! complex, with the remainder in the apo 
state (APC/C without coactivator). APC/CC*! Et adopts the active 
conformation with the platform sub-domain, incorporating 
Apc2“'?-Apcl1, in the upward position (Fig. 1a). However, in con- 
trast with APC/C““"! 4S" where Apc2“'?-Apcl1 is flexible’, Emil 
stabilizes Apc2“'?-Apcl1 by bridging it to Apel (Fig. 1a). 

With the APC/C@4*! E™! EM map at 3.6 A resolution, we built a 
complete atomic model of the complex. Existing crystal structures, 
complemented by newly determined structures of Apc4 and the 
N-terminal domain of Apc5 (unpublished data), together with homo- 
logy models (Extended Data Tables 2 and 3), were used to guide 
building of most large APC/C subunits. For Apcl, fitting to the 
N-terminal Apcl“?*? domain and the densities N- and C-terminal 
to its central PC domain (Apcl?°) were performed ab initio. These 
two regions coalesce to form Apc1™ that connects Apcl“?*° 
with Apcl’© and comprises an a-solenoid capped by a B-sandwich 
(Extended Data Fig. 3a). Apcl3 and Apclé of the tetratricopeptide 
repeat (TPR) lobe interact with structurally homologous symmetry- 
related sites on the four TPR homo-dimers (Extended Data Fig. 3b, c). 
The refined APC/C“*"! ™! model has excellent stereochemistry and 
is in good agreement with the EM density map (Extended Data Fig. 2 
and Extended Data Table 1). 


CCdh1 Emil 


1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK. 
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Apct5 Apes 


Cdh1‘"” binds to Apel and Apc8 


In the APC/C@"!=™"" EM map, the Cdh1\7”-assigned region shows 
clearly resolved side-chain features (Extended Data Fig. 1g). Four dis- 
tinct segments of Cdh1X"” are organized into two mini-domains 
(Fig. 2a). Segments 1 and 3 interact with Apc8B, whereas segments 2 
and 4 interact with Apcl’©. Segment 1 incorporates the seven-residue 
C box (residues 46-52) (Extended Data Fig. 3d), which, through extens- 
ive contacts with Apc8, is a key determinate of Cdh1N'-APC/C inter- 
actions. The C box adopts a hairpin conformation that inserts into the 
TPR superhelix of Apc8 (Fig. 2a, b). Asp46, Arg47 and Arg52 of the C 
box form a network of electrostatic interactions, whereas Phe48 and 
Ile49 mediate non-polar contacts. Strikingly, Arg47 and Ile49 bind a site 
on Apc8 that is structurally homologous to the IR tail-binding site of 
Apc3, a paralogue of Apc8 (Fig. 2b-d). In both TPR proteins, three 
invariant residues (Asn339, Ser342 and Glu374 of Apc8) coordinate the 
Arg guanidinium group, with the aliphatic moiety of Arg sandwiched 
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Figure 1| EM reconstructions of the APC/ 

Cob Emil complex. a, b, Two views of the atomic 
structure of APC/C°#!"™_ Large subunits are 
shown in cartoon, whereas the four small subunits, 
Emil, Cdh87”, Cdhi'’ and Apcl0™8, are shown as 
surface representations. Emil interacts with the 
substrate-recognition and catalytic modules. 


between a conserved aromatic residue and an invariant Leu (Fig. 2b-d). 
The aliphatic side chain of Ile49 of the C box is flanked by two tyrosines 
(both conserved in Apc3) and a Met (Leu in Apc3). The main role of 
segment 3 is to stabilize the conformation of segment 1 by contributing 
Leu127 and Phel28 to a cluster of hydrophobic residues at the Apc8- 
Cdh1N7” interface (Fig. 2a). Mutation of both residues severely abro- 
gates Cdh1 activity (Fig. 2e, lanes 1 and 4). 

Segment 2 forms a leucine zipper-like o-helical interface with two 
outer a-helices of Apc1”©, an interaction mediated by three conserved 
leucines (Fig. 2a). The a-helix of segment 4 aligns antiparallel to 
segment 2 (Fig. 2a). It includes the KLLR sequence (Extended Data 
Fig. 3d), structurally similar to the KILR motif of Cdc20“' that 
interacts with Mad2 as a B-strand"*, and is implicated in mediating 
APC/C-Cdc20 interactions’’. Leul60 and Leul61 of the KLLR 
sequence, together with Val153 of segment 4, contribute to the hydro- 
phobic interface of segment 2 and Apcl?© (Fig. 2a, f). 
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Figure 2 | The C box of Cdh1 and IR tails of Cdh1 and Apc10 interact with 
structurally related sites on Apc8 and Apc3. a, Cdh1N™ binds to Apc8 
and Apcl. Segments 1 and 3 of Cdh1N"” including the C box interact with 
Apc8, whereas segments 2 and 4 interact with Apcl’°. Segments 1-4 are 
labelled as S1-4. Phosphorylation sites Ser40, Thr121, Ser151 and Ser163 are 
shown as red spheres. b, Details of the C-box interactions. The Arg47 and Ie49 
interaction sites are structurally related to the IR tail-binding sites for Cdh1 and 
Apcl0 in Apc3 shown in c and d. S. cerevisiae CDC23 (Apc8) temperature- 


Hsl1-Ub,, 


Cdh1 


sensitive mutations” map to the C-box binding site (Ca of mutant residues 
shown as spheres in b). e, Mutation of C-box residues Arg47 and Arg52 
eliminates APC/C“"" activity, as do mutations of Apc8 residues that 
interact with either Arg47 or Arg52. WT, wild type. Ub,,, polyubiquitin. f, At 
the CdhiN??-Apcl?° interface, multiple residues cooperate to mediate 
APC/C-Cdh1 interactions. g, Ser40, Ser151 and Ser163 mediate the negative 
regulation of Cdh1 by CDK phosphorylation. Experiments in Fig. 2e, f were 
replicated three times and in Fig. 2g five times. 
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The C-terminal ten residues of Cdh1 form the IR tail. This is well 
ordered and adopts a mainly «-helical conformation that inserts into 
the TPR superhelix of Apc3A (Fig. 2c and Extended Data Fig. 3b). The 
universally invariant C-terminal Arg496 contacts residues of Apc3A 
analogous to how Arg47 of the C box contacts Apc8 (Fig. 2b, c). The 
adjacent Ile of the IR tail packs into a hydrophobic pocket on Apc3. 
Through its C-terminal IR residues, Apcl0 interacts with a similar 
conformation to the symmetry-related IR tail-binding site on the 
Apc3 homo-dimer (Apc3B) (Fig. 2d). 

Both C-box Arg47 and its coordinating Asn339 residue (Fig. 2b) 
are required for coactivator binding to the APC/C*”, and the 
biochemically defined interaction of Cdc20“"”. Disruption of Apc3 
residues Asn575 and Leu606 impaired coactivator binding”. Further- 
more, temperature-sensitive cell-cycle mutations of Saccharomyces 
cerevisiae CDC23 (Apc8) (ref. 23) can be rationalized because of their 
disruption of the C-box receptor (Fig. 2b). In support of these pre- 
vious data, mutating Asn339 and Glu374 of Apc8 that engage Arg47 
abolished the capacity of Cdh1 to activate the APC/C (Fig. 2e). By 
contrast, mutation of the structurally related Arg493 of the IR tail only 
reduced APC/C“"" activity (Extended Data Fig. 4a). Arg52 of the C 
box forms a salt bridge with Glu410 of Apc8 (Fig. 2b), and disrupting 
either residue eliminates APC/C activity (Fig. 2e). 

To explore the basis for the negative regulation of coactivators by 
CDK phosphorylation*'®"’, we first identified which of the nine 
potential CDK phosphorylation sites in Cdh1N? account for this 
control. Four of these sites (Ser40, Thr121, Ser151, Ser163) (Extended 
Data Fig. 3d) are both necessary and sufficient to mediate CDK- 
regulation of Cdh1 (Fig. 2g, lanes 1-6), in agreement with ref. 24. 
Although Ser40, Ser151 and Ser163 (but not Thr121) phosphoryla- 
tion each contributes individually to a degree of Cdh1 inactivation 
(Fig. 2g, lanes 7-14), simultaneous phosphorylation of all sites was 
required to inactivate Cdhl completely (Fig. 2g, lanes 1-4). 
Phosphorylation of these sites would destabilize Cdh1N'?-APC/C 
interactions through electrostatic repulsion and steric clashes 
(Fig. 2a). Ser40 is immediately N-terminal to the C box, whereas 
the side chains of Ser151 and Ser163 flank the KLLR sequence of 
segment 4 and contact Apcl?® (Fig. 2a). 


Emil inhibits UbcH10 and Ube2S binding 

Four functional elements of Emil mediate APC/C°*"! inhibi- 
tion’*’**, A D-box motif that occludes substrate recognition is con- 
nected through a linker to a zinc-binding region (ZBR) (Emi172®) 
that interferes with UbcH10-dependent APC/C activity’'**. A 
C-terminal LRRL sequence (LR tail: Emil"), identical to the LRRL 
motif required for Ube2S association with the APC/C*”"*, antago- 
nizes Ube2S'*"*, In the APC/C“"! *™"" structure, we identified EM 
density corresponding to all four elements (Fig. 3a, b). This agrees 
with a previous EM study at lower resolution that had assigned Emil 
density bridging the substrate recognition module and platform”. In 
our structure, the Emil D box is engaged by the D-box co-receptor of 
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Cdh1 and Apcl10 (Figs 1a and 3a). We identified EM density matching 
the in-between-RING (IBR) architecture of Emil7®® (ref. 13) assoc- 
iated with Apc2“'?-Apcl1 (Fig. 3a, b). The domain connects these 
subunits to an insertion within Apcl?©. In the EM map, part of the 
Emil''"“*" forms an o-helix that packs against the Emil”°* B-sheet 
and docks onto the UbcH10 binding site of Apcl1™'N% (Fig. 3a, b). 
This blocks UbcH10 association with Apcl1®'N°, 

We assigned Emil’ as a helical density feature to a site on 
Apc2'P at the interface of its four-helix bundle and o/B sub-domain, 
adjacent to Apc4”*° (Fig. 3b and Extended Data Fig. 2f), and in 
agreement with biochemical studies'*”’. Emil”** does not contribute 
to inhibiting Ube2S-catalysed ubiquitination reactions'*"*, and our 
structure shows that Ube2S-specific sites on Apcl jeNS (ref. 26) are 
not blocked by Emi17®® (Fig. 3b). 

The Apcl1™NS-assigned density of APC/C“*"!?™" was a perfect 
match to the crystal structure of Apcl NS (ref. 26) (Fig. 3a, b and 
Extended Data Fig. 5a). Assignment of EM density to Apc2“'?- 
Apcll indicated disorder of the Apc2©'? WHB sub-domain. The 
juxtaposition of Apcl1®'N° and Apc2°' is similar to the swung 
out conformation of Rbx1™N© in activated Cul5-Rbx1 (ref. 27) 
(Extended Data Fig. 5b-e). 


Apcl1®'X® positions UbcH10-ubiquitin 

Apcl1™ N° stimulates ubiquitination reactions catalysed by the ini- 
tiating E2s-UbcH10 and UbcHS, and the elongating E2-Ube2S”°. 
Activation of UbcH10 involves a canonical interaction between its 
UBC domain and Apcl pens (ref. 26), whereas for Ube2S, Apcl pas 
is repurposed to position the acceptor ubiquitin for modification by 
the Ube2S-ubiquitin conjugate*®. To understand substrate ubiquiti- 
nation catalysed by the APC/C-UbcH10 pair, we determined a three- 
dimensional reconstruction of an APC/C“*"!-substrate complex 
(APC/C“*"" HS!) with a UbcH10-ubiquitin conjugate. Because of 
the low affinity of APC/ C-UbcH10- ubiquitin interactions’, to facil- 
itate generation of APC/C°{?) Hs! UbcH10-U> Complexes suitable for 
cryo-EM, we fused the LR tail of Ube2S to the C terminus of UbcH10 
(UbcH10"®) (Extended Data Fig. 1c). This approach is similar to 
that used to target the heterologous E2 Ube2G2 to the APC/C”. 
The UbcH10" fusion protein exhibited a 100-fold slower dissoci- 
ation rate and a tenfold higher affinity for the APC/C (Extended 
Data Table 3). Competition and mutagenesis analyses indicated that 
the UBC domain of UbcH10™ binds the physiological site on the 
APC/C (Extended Data Fig. 4b, c). Ubiquitination assays showed that 
UbcH10"* and UbcH10 had indistinguishable activity (Extended 
Data Fig. 4d). 

The APC/CO*RE Hs UbcH10-U> complex was determined to 4.1 A 
resolution (Extended Data Fig. 2c and Extended Data Table 1b). 
Similar to APC/C““"! *™"), three-dimensional classification revealed 
both ternary (coactivator-bound) and apo states (Extended Data 
Fig. 6a, cycle 1). Visible EM density for the LR tail in the ternary 
state indicated stoichiometric association of UbcH10-ubiquitin 


Figure 3 | Interactions of Emil with Apc2“7?- 


Apcl1™'N% and D-box receptor of Cdhi and 
Apcl0. a, b, Views of the EM density and 
underlying secondary structure. Emil elements are 
shown: D box (Emil); Emil" and Emil728 
interconnects Apcl’° with Apc2“" and 

Apcl1™ NS, Both Emil" and Emil7™* block 
the UbcH10-binding site on Apcl11, but not the 
Ube2S-specific site*®. The C-terminal LRRL 

tail (Emi1"®) of Emil binds to Apc2°7”. 
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(Extended Data Fig. 2f). However, density corresponding to UbcH10, 
connected to Apc2“T?-Apcl1™'NS, was weak. 

Extensive __three-dimensional classification of | APC/ 
CCARLHSILUPCHIO-UP revealed structural variability of the UbcH10 
density consistent with low occupancy of the UBC domain (~36%), 
coupled with minor conformational variability of the Apc2°T?- 
Apcl1-UbcH10 catalytic module (Extended Data Fig. 6a, cycle 2). 
Subjecting a major three-dimensional class for further classification 
generated a three-dimensional class with clearly resolved UbcH10 
density (Extended Data Fig. 6a, cycle 3, class 5, and Extended 
Data Fig. 6b, c). This structural class was suitable for docking 
UbcH10 coordinates (Fig. 4a, b) and is representative of other 
APC/C-UbcH10 conformational states (Extended Data Fig. 6d, e). 
Relative to APC/C*P!P™"!) Apc2P-Apcl1™'NS tilts slightly at 
the Ape2"P—Ape2"? interface (Extended Data Fig. 5f), and 
Apcl1 NS tilts relative to Apc2°'? (Fig. 4c). UbcH10 and 
Apcl12NS interact through a canonical E2-RING mechanism”, 
thus docking of UbcH10-Apcl1™'N% was guided by crystal structures 
of RING-UbcH5-ubiquitin complexes” (Fig. 4a, b). Importantly, 
this showed that density for ubiquitin was not visible in the EM 
map, indicating that the ubiquitin moiety is flexible (Figs 4a, b 
and 5a). Consistent with this conclusion, a reconstruction of an 
APC/COP HSH APcH-UPCHI9 Fusion complex (Extended Data Figs 1b 
and 2d and Extended Data Table 1b), and a negative-stain structure 
of APC/C“! 4s! with a huge excess of UbcH10, showed that 
the shape of the UbcH10-assigned density resembled the UbcH10- 
ubiquitin density in the APC/CC HS YPH1O-UP map (Extended 
Data Figs 6c and 7). 

Since UbcH10 and Apcll™® interact through a canonical 
E2-RING interface (Fig. 4a, b), the APC/C probably stimulates the 
intrinsic catalytic activity of UbcH10-ubiquitin*' by promoting a 
closed conformation that primes the UbcH10-ubiquitin thioester 
bond’*”?*°*?*°. In agreement with this, mutation of Ile36 that con- 
tacts Apcl 12NG (refs 29, 30, 34) and Ile44 that contacts UbcH10 
(refs 15, 29, 30, 32, 34) (Fig. 5a) ablates APC/C-catalysed ubiquitina- 
tion (Extended Data Fig. 4e, f). Nevertheless, the absence of EM 
density for the ubiquitin moiety of the UbcH10"*-ubiquitin conjug- 
ate indicates that APC/C does not stabilize a static closed UbcH10- 
ubiquitin conformation’**°**, Modelling shows that only Apcl1™!N° 
would interact with ubiquitin in the closed UbcH10-ubiquitin 
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Figure 4 | Structure of the APC/CO4PHSI-UPcH10-Ub Complex reveals the 
location of UbcH10. a, b, Views of the EM density at the catalytic centre with 
UbcH10-ubiquitin. No density for the ubiquitin moiety was recovered. Arg27 
and Arg77 of Apcl11, required for UbcH10 interactions”, are indicated. c, The 
UbcH10- and Emil-binding sites of Apcl1®NS overlap. The Apel 1®N@ 
orientation differs slightly in the two complexes. 
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conformation (Fig. 5a), whereas in crystal structures of RING- 
UbcH5-ubiquitin conjugates where ubiquitin is static, either a second 
RING subunit”*° or a non-RING element* also contact ubiquitin 
to enhance catalytic efficiency. The APC/C is reminiscent of 
other single domain RING and U-box E3s that bias the E2-ubiquitin 
conformation from multiple extended states to the closed state***’. An 
interesting possibility is that substrate initiation motifs that promote 
lysine ubiquitination®® may induce a closed UbcH10-ubiquitin 
conformation. 


Defining target lysine ubiquitination 

The APC/C©4RL Hs!1 UbcH10-UP model shows that the UbcH10 catalytic 
Cys is some 40 A from the C terminus of a D box at the D-box 
receptor® (Fig. 5a). An extended linker of ten residues would place 
a lysine C-terminal to the D box at the UbcH10-ubiquitin catalytic site 
(Fig. 5a). Consistent with this, peptides with a lysine at positions K10, 
K13 and K18 relative to the D box were ubiquitinated, but not peptides 
with lysines at K1, K4 and K7 (Fig. 5b, c). Peptides modelled on known 
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Figure 5 | The relative position of the catalytic and substrate-recognition 
modules defines the target lysine. a, View of APC/C4M HS UbCHL-UP with 
close-up view of the catalytic and substrate-recognition modules. A UbcH10- 
ubiquitin conjugate is modelled in the closed conformation”. The Ube2S- 
specific site of Apcl1®"NS (for distal ubiquitin positioning)”* is indicated. The 
KEN boxand ABBA motif are modelled on ref. 39. The distance from the D and 
KEN boxes to the catalytic centre of UbcH10 (measured as Ca—CO distance: 
P10 of D box and ‘N’ of KEN box to Gly76 of ubiquitin) is indicated. A ten- 
residue linker is modelled connected to P10 of the D box with Lys at K10. 
Spheres denote Ca-atoms of D-box P10 and linker K10. b, The reactivity of 
defined peptides with progressively longer spacers separating the D box and 
accepter lysine residue (sequences shown in c). Only peptides with a lysine ten 
or more residues C-terminal to the D box are substrates. *Ube2S-specific band. 
The experiment was replicated three times. c, Peptide sequences used in 
ubiquitination assays. D box in yellow, target lysines in blue. 
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APC/C substrates, geminin (Lys50, equivalent to K18 (ref. 38)) and 
cyclin B (Lys64, equivalent to K13), were also ubiquitinated. 


Conclusions 


The atomic structure of the APC/C provides a quantitative framework 
for understanding APC/C mechanisms and the basis for future stud- 
ies to investigate mechanisms of control by phosphorylation and the 
MCC. We speculate that in the APC/C“*° complex inhibited by 
MCC, the second C-box receptor of the Apc8 homo-dimer may bind 
the Cdc20 subunit of the MCC**°. TAME is an Ile-Arg dipeptide 
APC/C inhibitor*’. Its proposed mechanism is to inhibit coactivator 
association by competing for the IR tail-binding site on Apc3. The 
homology of the C-box receptor of Apc8 and the IR tail-receptor of 
Apc3 suggests that TAME may also disrupt C-box—APC/C interac- 
tions, a notion consistent with the more important role of the C box 
for APC/C activity and cell viability”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Cloning, expression and purification of recombinant human APC/C. The 
genes for recombinant human APC/C“"!™"! were cloned into a modified 
MultiBac system as described previously. Two pU2 plasmids were prepared: 
pU2-1 harbouring genes for Apcl and Apcl1, and pU2-2 containing genes for 
Apc5, Apc8, Apcl0, Apcl3 and Apcl15. Two pF2 plasmids were prepared: pF2-1 
harbouring genes for Apc3, Apc6, Apc7, Apcl2 and Apcl16, and pF2-2 containing 
genes for Apc2, Apc4, Cdh1 and Emil. A C-terminal tev-cleavable double Strep-II 
tag was added to Apc4 for purification of the complex. Two bacmids were pre- 
pared from these four plasmids, resulting in two viruses harbouring all 16 genes 
with each gene flanked by its own promoter and terminator. The APC/C@1 E™ 
ternary complex was co-expressed from these two viruses in High 5 insect cells. 
The complex was purified by a combination of Strep-Tactin (Qiagen), anion 
exchange chromatography Mono Q and Superose 6 size-exclusion chromato- 
graphy (GE Healthcare) similar to ref. 42. 

APC/COPHSIL-Apel1-UbcH19 Was cloned, expressed and purified as described 
previously‘, except that Apcl1 was replaced with Apcl1 fused to UbcH1014 
at its C terminus by a 30-residue linker of GSA repeats. A similar version of 
APC/COPHSL-APeII-UbcH19 Fusion complex was also prepared using wild-type 
UbcH10 instead of UbcH10“"™ for activity studies. 

UbcH10"* was constructed by fusion of residues 154-222 of Ube2S to the C 
terminus of UbcH10“"*™ in the pET28 plasmid. An N-terminal His-tev site 
was added to UbcH10". In the purified ubiquitin and UbcH10"™ proteins the 
Hisg tag was retained on ubiquitin, but cleaved from UbcH10®. The UbcH10!8- 
ubiquitin conjugate was generated as previously described* with minor modifica- 
tions. UbcH10"* (200 uM) was incubated with Hise-tagged ubiquitin (400 1M) at 
25°C for 16 h in a buffer containing 5 14M UBA1, 3 mM ATP, 5 mM MgCh, 
0.8 mM TCEP, 150 mM NaCl, 50 mM Tris-HCl (pH 10.0). Additional 2 mM 
ATP and 2 4M UBAI were added to the reaction and incubated at 25°C for 
another 8 h. The UbcH10"*-ubiquitin conjugate was separated from unconju- 
gated UbcH10'" by Ni-NTA and separated from free ubiquitin by $75 size- 
exclusion chromatography (GE Healthcare). APC/C“*"!"*" was purified as 
described previously’. The purified APC/C“*"""" was concentrated to 3 uM. 
APC/CORHSIL-UbcH10-UP was reconstituted by incubating APC/C@?*4" with 
UbcH10"®-ubiquitin conjugate in a 1:3 molar ratio on ice for 30 min in buffer of 
20 mM HEPES (pH 8.0), 150 mM NaCl, 1 mM DTT. Just before applying to EM 
grids, the sample was cross-linked with 0.03% glutaraldehyde, and incubated on 
ice for 40 min. The reaction was quenched by adding 40 mM Tris-HCl (pH 8.0). 
The quenched sample was incubated on ice for 30 min before further purification 
with size-exclusion Superose 6. A similar UbcH10"* protein was made by using 
wild-type UbcH10 for use in activity tests. Surface plasmon resonance experi- 
ments for UbcH10'* and APC/C were performed as described previously‘. 
The Y91A and $123G mutants of UbcH10 and UbcH10LR were prepared as 
for wild-type UbcH10. 

For ubiquitination reactions, APC/C and its variants with mutant subunits 
(Apcl and Apc8) were cloned, expressed and purified as for apo APC/C” 
by replacing the target subunit with its mutant. These included APC/C with 
Apel it97A/Li227A230A/812314 ApCiC with Apc8'23°4/E3744, APC/C with 
Apc8X?3°4/83748 and APC/C with Apc8@)°%. 

Human Cdh1 was cloned into pU1 with an N-terminal His, followed by a 
haemagglutinin tag. On the basis of this plasmid, variants of Cdh1 mutations 


were made. These were Cdh1A/L614- Cdh1¥914/1994, Cdh1 TART AIP IZOD 
Cdh 1944/1 994/V 1534 Cdh 1b OA/LI1AYSIAILIOA, Cdh1 LIGOA/LIG1A/L127A/F128A. 


> 
Cc dh 1 2} LS9A/LI27A/F 128A C dh ]E1OOALAGLA/YSLAILO9A/LI27A/F1284 C dh 12474, C ah R24. 
3. 
Cdhi 84934, Cdh1™ 12LAISISIA/S163A, Cdhh1S404/S151A/81634, Cdh18404/T1214/5163 A 
Cdh1S404/T 21/8151 Ch S#0NTIZIAISISLA/SI63A and Cdh1 T32A/S36A/S70A/S138A/S146A. 


All Cdh1 and its mutations were expressed in High Five insect cells. To purify 
Cdh1, the cells were lysed in a buffer of 50 mM Tris-HCl (pH 7.3), 500 mM NaCl, 
20 mM imidazole (Buffer A), with a protease inhibitor cocktail. After loading, the 
Ni-NTA column was washed with ten column volumes of buffer A, followed by a 
gradient wash to 200 mM imidazole in buffer A and further washing with ten 
column volumes of 200 mM imidazole in buffer A. The protein was eluted in 
buffer B of 300 mM imidazole (pH 7.3), 500 mM NaCl. Cdh1 was finally purified 
using S-200 size exclusion chromatography in 20 mM HEPES (pH 7.0), 200 mM 
NaCl, 1 mM DTT. 

Human ubiquitin complementary DNA was cloned into a pET 28 plasmid with 
an N-terminal His, tag and TEV cleavage site. On the basis of this plasmid, 
ubiquitin’ and ubiquitin'** mutants were prepared. Wild-type and mutant 
ubiquitin were expressed in B834™ strain and purified by Ni-NTA and $75 size- 
exclusion chromatography (GE Healthcare). 
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APC/C ubiquitination assays. Human APC/C and its different mutant deriva- 
tives, UBA1, UbcH10 and its mutants, Ube2S, ubiquitin and its mutants, and 
yeast Hsl1°°’-*”* were buffer exchanged into 20 mM HEPES (pH 8.0), 150 mM 
NaCl, 1 mM DTT. Human Cdh1 and its mutants were in 20 mM HEPES (pH 7.0), 
200 mM NaCl, 1 mM DTT. Ubiquitination assays were performed as described’. 
For the UbcH10 competition assays, UbcH10 ©'"** and UbcH107!"**"® at the 
concentrations 0.001, 0.01, 0.1, 1 and 10 UM were used to compete with UbcH10 
(0.1 LM). Cdh1 was at 20 nM. To test the UbcH10-RING domain interface, the 
following UbcH10 and UbcH10"* mutants were prepared and tested: Y91A and 
$123G. Single time point assays were at 10 min. 

Cdh1 phosphorylation assays. CDK2-dependent phosphorylation of Cdh1 and 
its mutants was performed in buffer containing 50 mM HEPES (pH 8.0), 10 mM 
MgCl, 2 mM ATP, 10 mM NaF, 10 mM glycerol phosphate and 150 mM NaCl. 
The reactions were incubated at 23 °C for 30 min with 500 nM of Cdh1 and 50 nM 
of active phosphorylated CDK2-cyclin A3. The phosphorylation was terminated 
using the CDK1/2 inhibitor (Calbiochem). Non-CDK2-treated Cdh1 samples 
were set up in parallel with phosphorylating reactions by adding buffer instead 
of CDK2-cyclin A3. The activity assays were performed as described for ubiqui- 
tination assays. 

Peptide assays. Peptides for D-box lysine distance analysis were synthesized by 
Designer BioScience. The peptides were modified by N-terminal biotinylation. 
Peptides used were the following: Pep-1K (EQRPRTALGDISNSKASGS); Pep-4K 
(EQRPRTALGDISNSAGSKASGS); Pep-7K (EQRPRTALGDISNSAGSAGSKA 
SGS); Pep-10K (EQRPRTALGDISNSAGSAGSAGSKASGS); Pep-13K (EQRPR 
TALGDISNSAGSAGSAGSAGSKASGS); Pep-18K (EQRPRTALGDISNSAGSA 
GSAGSAGSAGSGSKASGS); GEM-K50 (EQRPRTALGDISNSASGSLVGRENE 
LSAGLSKRRHR); and CycB-K64 (GLRPRTALGDIGNRVSEQLQARMPMRKE 
ARPSATGS). The peptides were dissolved at a concentration of 6.25 mM in 100% 
dimethylsulfoxide (DMSO). The ubiquitination assays were performed as 
described above except that 125 11M of peptide instead of Hsll was used as 
substrate. Previous work showed that at 100 11M, D-box-only peptides bind the 
D-box receptor of APC/C“""" (refs 43, 44). The reactions were separated in 
4-12% NuPAGE Bis-Tris gels (Life Technology) by 60 V for 2 h, followed by 
150 V for 30 min for better resolution of peptides and ubiquitination products. 
After blotting, the membranes were blocked in 10% non-fat dry milk in PBS 
overnight at 4°C. After washing the blot four times with PBS-2% Tween 20 
buffer, ubiquitinated peptide products were visualized by western blotting with 
PBS-buffered streptavidin-HRP (Sigma) with 1% bovine serum albumin. 
Electron microscopy. Freshly purified recombinant human APC/C complexes 
(APC/Co*!- ae APC/CO!- -Hsl1.UbcH10-Ub and APC/CC*#!- -Hsl1.Apel1- Uber Oy 
were first visualized by negative-stain electron microscopy to check homogeneity. 
Complexes were then diluted to ~200 jg ml ' and aliquots of 2 11 were applied to 
Quantifoil R2/2 grids coated with a second layer of home-made thin carbon film 
(estimated to be ~50 A thick), treated with a 9:1 argon:oxygen plasma for 20 s 
before use. The grids were incubated for 30 s at 4°C and 100% humidity and 
then blotted for 6 s and plunged into liquid ethane using an FEI Vitrobot III. The 
grids were loaded into an FEI Tecnai Polara electron microscope at an accelerat- 
ing voltage of 300 kV. Images were taken manually using low-dose mode at a 
nominal magnification of X78,000 (yielding a pixel size of 1.36 A per pixel) for 
APC/COPE EMD and APC/COPEHSLAPCHI-UbEH19, and X59,000 (yielding a pixel 
size of 1.77 A per pixel) for APC/CC1 HSL UbeH10- ‘Ub . Images were recorded using 
an FEI Falcon II direct electron detector with a defocus range of ~2.5-4 jim. Each 
image was exposed for 2 s at dose rate of 27 electrons per square angstrém per 
second at a nominal magnification of 78,000 and 16 electrons per square 
angstrém per second at a nominal magnification of 59,000. Thirty-four movie 
frames were recorded for each image as previously described. 

To prepare a negative-stain reconstruction of APC/CS!5_UbcH10 
(with excess wild-type UbcH10), we incubated freshly purified APC/CC41 1"! 
(0.04 1M) with a 1,500-fold excess of UbcH10 (60 pM, 200-fold in excess of 
dissociation constant Kq) for 1 h on ice. Images were collected on an FEI Spirit 
electron microscope at an accelerating voltage of 120 kV at —1.5 pm defocus. 
The map was reconstructed using RELION without contrast transfer function 
correction. 

Image processing. All movie frames were first aligned by the motioncorr pro- 
gram“ before subsequent processing. Contrast transfer function parameters 
were calculated with CTFFING3 (ref. 47). Particles in 264 pixels X 264 pixels 
(or 200 pixels X 200 pixels for data collected at <59,000 magnification) were 
initially selected automatically by e2boxer.py in EMAN2 (ref. 48) from a 
small portion of the same data set (typically ~300 micrographs). After two- 
dimensional classification using RELION”, ten images in different views were 
selected from the two-dimensional averages and used as reference for automatic 
particle picking for the whole data set by RELION 1.3 (ref. 50). The following 
procedures were performed in RELION to exclude bad particles from the final 
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reconstruction. First, particles in each micrograph were displayed and manually 
screened to delete wrongly picked ice contamination. Second, extracted particles 
were sorted by similarity to reference images, and particles with low z-scores were 
deleted (roughly 10% particles were deleted in this step). Third, two-dimensional 
classification was performed and particles in bad classes (with poor structural 
features) were removed (roughly 10-20% of particles were deleted in this step). 
The resultant particles were then used for three-dimensional classifications. 
Conformational heterogeneity and remaining bad particles were excluded from 
the final reconstruction in this step. A summary of particles used for final refine- 
ments is listed in Extended Data Table Ic. 

Beam-induced particle motion was corrected using statistical movie processing 
in RELION as previous described**. For APC/C, a running average of nine movie 
frames for frames within the first 32-electron dose and a standard deviation of one 
pixel for the translational alignment were used. This step improved the resolution 
from 4.2 A to 3.9 A. Following this, particle polishing in RELION 1.3 was used to 
improve the resolution further®’. This program fits all movie frames as linear 
tracks, considering movements of neighbouring particles to improve signal for 
smaller particles. Furthermore, each frame is weighted by a B-factor (estimated by 
each frame reconstruction) to treat radiation damage problems. Particle polishing 
was then used to improve the resolution from 3.9 A to 3.6 A. All resolution 
estimations were from gold-standard Fourier shell correlation (FSC) calculations 
to avoid over-fitting and reported resolutions are based on the FSC = 0.143 
criterion”**. Final FSC curves were calculated using a soft spherical mask (with 
a five-pixel fall-off) of the two independent reconstructions. To visualize high- 
resolution details, all density maps were corrected for modulation transfer func- 
tion of the detector and sharpened by applying negative B-factors, estimated 
using automated procedures”. 

Motion correction was performed after three-dimensional classification for 
APC/C™" EM Which showed ~60% ternary complex and ~40% apo state 
APCIC. For APC/CCA2!-Hsll.UbcH10-Ub and APC/CCAbL AsIL Apel 1-UbcH10 all part- 
icles were refined against one model and motion corrected as described above. 
The polished particles were used for three-dimensional classification as shown in 
Extended Data Figs 6 and 7. The first cycle of classification was done by 
global search and a sampling angular interval of 7.5° to sort out large conforma- 
tional changes; cycles 2-4 were executed by local search within 15° and a smaller 
sampling angular interval of 3.7°. Three-dimensional classifications were run for 
25 iterations with class number set at 10 and regularization parameter T set at 4. 

To further improve the density map of flexible parts (RING domain and 

UbcH10) in the complexes, a soft mask was used for local alignment during 
refinement. A soft mask was generated according to combined coordinates of 
Apcl/Apc2/Apc4/Apc5/Apcl1/UbcH10 (approximately half of the mass) and a 
restrained angular search of 3° was used. Local resolution was estimated using 
ResMap™. 
Model building of APC/C. Modelling of the structures of APC/C was performed 
in COOT”. Initially, available crystal structures and homology models as 
described in our previous work‘ were fitted into the cryo-EM map in Chimera. 
These structures included (1) crystal structures of human Apc10 (residues 2-162, 
Protein Data Bank (PDB) accession number 1JHJ)°°, (2) N-terminal seven 
helices of Apc7 (residues 21-166, PDB accession number 3FFL)*’, (3) 
Schizosaccharomyces pombe Cut9-Hcn26 complex (Apc6-Apc12 homologue, 
PDB accession number 1XPI)**, (4) N-terminal dimerization domains of 
Encephalitozoon cuniculi Cdc27 (Apc3 homologue, PDB accession number 
3KAE)”, (5) S. pombe Cut23 (Apc8 homologue, PDB accession number 
3ZN3)", (6) Cdh1-D-KEN (PDB accession number 4BH6)*’, (7) ApcS TPR 
domain and C-terminal TPR helices of Apc3, Apc8 and Apc7 (model using the 
crystal structure of S. pombe Cut9 as template), (8) crystal structures of Apc4 and 
Ape5N7TP (residues 1-160) (unpublished data) and (9) NMR structure of Emil 
ZBR domain’* (PDB accession number 2M6N) and Apcl1 RING domain” (PDB 
accession number 4R2Y). For Apc2-Apcl 1, the structure was modelled using a 
crystal structure of Cull (PDB accession number 1U6G)* as a template. For Apcl 
PC repeats, the structure was predicted with I-Tassar® using the PC domain 
structure of Rpn2 (PDB accession number 4ADY)® as a reference. All fitted 
structures were rebuilt according to the cryo-EM map. The majority of Apcl, 
small subunits (Apc13, Apc15, Apc16), IR C-terminal tails of Apcl10 and Cdh1, 
N-terminal domain of Cdh1 (Cdh1N™) and some loops not seen in crystal 
structures and homology were built ab initio. Details are given in Extended 
Data Table 2. 

The model was refined by REFMAC version 5.8 (ref. 64). A REFMAC weight of 
0.04 was defined by cross-validation using half reconstructions®. The resolution 
limit for refinement was set at 3.5 A. All high-resolution crystal structures or 


NMR structures were used for secondary structure restraints. A summary of 
model refinement is listed in Extended Data Table 1a. 

Rigid-body fitting was performed to UbcH10 by the ‘fit in map’ program in 
Chimera®. The crystal structure of the Rnf4-UbcH5a-ubiquitin heterotrimeric 
complex (PDB accession number 4AP4)” was used to model ubiquitin not seen 
directly from the reconstruction owing to flexibility. A summary of model build- 
ing is listed in Extended Data Table 2. 

All regions of the APC/C EM density map were assigned to APC/C subunits. 
However, the intrinsically disordered regions of some APC/C subunits were not 
visible in the EM map (defined in Extended Data Table 2). Altogether, 78% APC/ 
C residues were fitted to the EM map. Inter-subunit contact areas were calculated 
using CCP4 (ref. 67). 

Map visualization. Figures and movies were generated using Pymol and 
Chimera®. 

Structural conservation. Structural conservation was calculated using 
CONSURE® on the basis of a multiple sequence alignment of Homo sapiens, 
Mus musculus, Xaenopus laevis, Caenorhabditis elegans, Drosophila melanoga- 
ster, Arabidopsis thaliana, S. cerevisiae, S. pombe, APC/C subunit sequences. 
Protein similarity searches. Protein similarity searches were performed using 
DALI”, and protein alignment figures created using ALSCRIPT”. 
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Extended Data Figure 1 | Preparations and EM images of APC/C complexes. 
a, Coomassie-blue-stained SDS gel of APC/C@*"!E™ bh Coomassie-blue- 
stained SDS gel of APC/ CCdhL sll Apel 1-UbcH10 © Coomassie-blue-stained SDS 
gel and western blot analysis (anti-His antibody—only ubiquitin in the complex 
contains His tag) of APC/CC NS VbH10-U> without or with cross-linking 
by glutaraldehyde. d, A typical cryo-EM micrograph of APC/C@1™; 
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representative of 3,328 micrographs. e, Gallery of two-dimensional averages 
of APC/C@4hL Emil showing different views; representative of 100 two- 
dimensional averages. f, Local resolution map of APC/C“™ ="! showing 
resolution range. g, Details of EM density for segments of «-helix and B-strand 
of Apcl and the C box of Cdh1. 
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Extended Data Figure 2 | Resolution estimation and example of de novo 
model building. a, Gold-standard FSC curve and FSC curve between cryo-EM 
map and final atomic model of the APC/C“*"!=™", b, Cross-validation of 
model refinement by half maps. Shown are FSC curves between the atomic 
model and the half map (map™™’) it was refined against, and FSC curves 
between the atomic model and the other half map (map™™”) that was not used 
during refinement. c, Gold standard FSC curve of APC/CC4™ HS! UbcH10-UD_ 
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APC/CCdh1.HsI1.Ape11-UbcH10 APC/CCdh1.HsI1 (EMDB: 2651) 

d, Gold-standard FSC curves of human APC/CC) Hs} Apcll-UbeHI9 &, De novo 
model building of Apc15 N-terminal loop. The surrounding Apc5 residues are 
also shown. f, EM density for the LR tail common to Emil and Ube2S is 
only observed in APC/C complexes with subunits incorporating the LR tail. 
(1) APC/CSPEE™ (2) An APC/C complex with an LR tail-bearing subunit 
(UbcH10™ of APC/CCSP FSIEUPEHIO-UP), (3) Nig LR tail density in 
APC/COAhLHslL Apel -UbcCH10 fusion and (4) APC/C°4 HS! (ref. 4), 
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Extended Data Figure 3 | Apcl structure and the TPR lobe interacts with 
multiple subunits. a, Cartoon of Apcl colour-ramped from blue to red for 
the N to C termini. Insertions that interact with Apc2, Apc8 and Apcl0 are 
labelled. Apc1™“ adopts a novel architecture. b, TPR lobe with TPR subunits 
shown as surface representations, with the small TPR-accessory subunits 
(Apcl12, Apcl3, Apc15 and Apcl16), a segment of Apc5, IR tails of Cdh1 and 
Apcl0, and the Cdh1 C box that interact with the TPR lobe are shown as 
cartoons. The N termini of Apcl2, Apcl3 and Apcl5 are buried. The eight 
structurally homologous and symmetry-equivalent sites on the TPR lobe that 


bind Apc13, Apcl6 and Apc are indicated and shown in detail in c. The view is 
similar to Fig. la. c, The eight TPR subunits interact with Apcl3, Apcl6 and 
Apc5 mainly through contacts to four conserved aromatic residues present on 
most TPR subunits (Y308, Y309, W302, Y322 of Apcé (panels 5 and 6)). 

d, Sequence of the ordered region of Cdh1X™” bound to Apcl and Apc8 
(ordered regions shown as lines and «-helices). Critical Apcl and Apc8 contact 
residues are indicated with green and blue arrows. Phosphorylation sites are 
indicated with red arrows. 
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Extended Data Figure 4| APC/C ubiquitination assays. a, Mutation of 
Arg493 of the IR tail reduces APC/C“™ activity. b, Mutations at the 

RING domain interface of UbcH10 and UbcH10"* disrupt ubiquitination 
activity. c, Ubiquitination assay shows that both UbcH10(C114K) and 
UbcH10'*(C114K) compete with wild-type UbcH10. UbcH10"*(C114K) is a 
more potent inhibitor. d, The APC/C-UbcH10-mediated substrate 


ubiquitination activities of UbcH10 and UbcH10"™ are indistinguishable. 

e, The ubiquitin (136A) and ubiquitin (144A) mutants were defective for 
APC/C-UbcH10-mediated substrate ubiquitination. Experiments in Extended 
Data Fig. 4a—e were replicated three times. f, UbcH10 charging by the ubiquitin 
(136A) and ubiquitin (144A) mutants was unchanged relative to wild-type 
ubiquitin. 
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Extended Data Figure 5 | The position of Apcl1™'% in the APC/C is more 
similar to Rbx1®""S of activated cullin-Rbx1 structures. a, Identification of 
Apcl1 in apo APC/C. Left panel: EM density map for apo APC/C with the 
coordinates of Apc2©'?-Apcl 1 fitted (from APC/C“*"! *™" structure). Right 
panel: EM density for APC/C4?*'!4®'NS. The difference density corresponds 
to Apcl1®'NS. EM density maps from ref. 4. b, Superimposed Apc2°"” onto 
Cul1°7? (PDB accession number 1LDK)*. c, Superimposed Apc2“"” onto 
Cul5°T? (PDB accession number 3DQV)”’. In the inactive conformation of 
Cull-Rbx1, Rbx1®!N¢ packs against WHB. In APC/CCE™i the location of 
Apc11™NS remains in contact with Apc2°"” but has rotated ~180° relative to 
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inactive CRL structures, being similar to the swung out conformation of 
Rbx1 24S of neddylated and activated Cul5-Rbx1 (ref. 27). d, e, The relative 
orientation of Apc2N™ and Apc2°T” is also dramatically different from Cull 
(ref. 61). This is due to a 70° rotation within cullin repeat 3 (between helices 
A-B and C-D-E), and a ~20° rotation around the 4HB-cullin repeat 3 
interface. Similar less pronounced structural variations are observed within 
the CRL family. d, Apc2—Apcl11 (this study). e, Cull-Rbx1 (PDB accession 
number 1LDK)’. f, The Position of the Apc2“T-Apcl1 module differs 
slightly about the Apc2“"?-Apc2“" interface between APC/C*?1E™! 

and APC/CC#b! Hsil-UbeH10-Ub 
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Extended Data Figure 6 | Three-dimensional classification of 
APC/CCAL- Hs. UbcH10-Ub 4 The three-dimensional classification (cycle 1) 
started with 477,850 motion-corrected particles, which were divided into ten 
classes. The resultant classes were grouped into five categories: (1) 58.8% in the 
active ternary state with coactivator and substrate (Hsl1); (2) 11.9% in the apo- 
inactive state; (3) 9.4% in a class with Cdh1 bound but with the catalytic module 
in the apo-inactive conformation (hybrid state); (4) 16.5% with weak Apc2 
density; and (5) 3.4% were a poor reconstruction due to bad particles. 
Examination of the hybrid state (3) showed that density for Apcl”"° was 
absent, explaining the lack of Cdh1-induced conformational change of the 
catalytic module. Particles in the ternary state (reconstructed to an overall 
resolution of 4.1 A) were subjected to further three-dimensional classification 
(cycle 2). A major class (class 7, 26.4% of particles) showed improved density 
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for UbcH10 (circled), and cycle 3 classification was performed on particles in 
this class. The major class of cycle 3 (class 5, 25.9% particles) showed further 
improved UbcH10 density. Further three-dimensional classification of this 
class did not improve the UbcH10 density. b, Particles in the best class (cycle 3, 
class 5, 19,939 particles) were refined in RELION and resulted in a map at 5.7 A 
resolution (Extended Data Figure 2c). The UbcH10 density was improved by 
local alignment using a soft mask (indicated by circles) as described in 
Methods. c, Enlarged view of UbcH10 density. d, Enlarged view of the averaged 
APC/CO4D Hs1L-UbCH10-UP reconstruction from cycle 1 of the three-dimensional 
classification (59% of particles). UbcH10 density is circled. e, Superimposition 
of classes 6, 7 and 8 of cycle 2 of the three-dimensional classification (from 
a), showing the structural variability of the three three-dimensional classes 
that indicate UbcH10 density. UbcH10 density is circled. 
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Extended Data Figure 7 Three-dimensional classification of 
APC/CCAhLHSILApel1-UbcH10_ 4 The three-dimensional classification started 
with 97,999 motion-corrected particles, which were divided into five classes. 
The resultant classes were grouped into three categories: (1) 80.6% in the active 
ternary state with coactivator and substrate (Hsl1); (2) 9.3% in the apo state; 
and (3) 10.1% in a hybrid state. Particles in the active ternary state 
(reconstructed to an overall resolution of 4.3 A) were subjected to cycle 2 
classification with ten classes. UbcH10 density in the resultant classes is 


UbcH10 density 


600x excess UbcH10 
(negative stain) 


indicated with circles. b, Classes with UbcH10 density in cycle 2 classification 
are superimposed, showing variability of UbcH10. c, Enlarged view of 
UbcH10 density. d, A negative-stain EM reconstruction of an APC/CC2) HS 
complex at ~25 A resolution with a 1,500-fold excess of UbcH10. The 
molecular surface is shown as a mesh representation and the coordinates of 
the APC/CCP HS UPCH10-U> were docked into the EM reconstruction. The 
UbcH10 coordinates fit new EM density proximal to Apcl1. 
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Extended Data Table 1 | EM data collection and processing statistics, and structure refinement statistics 


a. Statistics of APC/C°*""=™"' structure determination 


Data collection 
EM: 
Detector: 
Pixel size (A): 
Defocus range (um): 
Reconstruction 
Software: 
Accuracy of rotations (degrees): 
Accuracy of translations (pixels): 
Final Resolution (A): 
Refinement 
Software: 
Refmac weight: 
Resolution limits (A): 
Residue number 
Average Fourier shell correlation 
R factor: 
Rms BondLength: 
Rms BondAngle: 
Validation 
Ramachandran plot: 
Preferred: 
Allowed: 
Outliers: 


b. Statistics of all cryo-EM data 


FEI Polara, 300keV 
FEI Falcon Il 

1.36 

1.5-4 


RELION 1.3 
1.073 

0.691 

3.6 


RefMac 5.8 
0.04 

3.5 

8408 
0.8315 
0.222 
0.0135 
1.8098 


7669 (92.88%) 
352 (4.26%) 
236 (2.86%) 
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Extended Data Table 2 | Summary of model building of APC/C subunits 


Subunit 
Apc 


Apc2 


Apc3 


Apc4 


Apc5 


Apc6 


Apc7 


Apc8 


Apc10 
Apc11 
Apci2 
Apc13 
Apc15 
Apc16 
Cdh1 
Emi1 
UbcH10 


Ubiquitin 


Length 
1,944 


822 


824 


808 


755 


620 


599 


597 


185 
84 
85 
74 
121 
110 
496 
447 
179 


76 


Domain/Region 1 
WD40 domain (1-612) 
De novo 


NTD (1-432) 
Homology. PDB: 1LDK 


TPR dimer interface 
TPR motifs 1-7 (1-535) 
Homology. PDB: 3KAE 


WD40 domain/4HB 

X ray. PDB: SBPW 
Inserts: De novo 

NTD (1-169) 
Homology. PDB: 5BPZ 


TPR dimer interface 
TPR motifs 1-7 (1-261) 
Homology. PDB: 2XPI 
TPR dimer interface 
TPR motifs 1-3 (21- 
166) 

X ray. PDB: 3FFL 

TPR dimer interface 
TPR motifs 1-7 (1-287) 
Homology. PDB: 3ZN3. 
Doc homology (2-162) 
X-ray. PDB: 1JHJ 
B-strand (1-18) 
Homology. PDB: 1LDK 
N-term (1-25) 

X-ray. PDB: 3HYM 

De novo 

De novo 

De novo 

NTD (1-163) 

De novo 

D box (319-335) 

X ray/De novo 

UBC domain (30-179) 
X-ray. PDB: 117K 
X-ray. PDB: 4AP4 


Domain/Region 2 
Mid-N (613-986) 
Mid-C (1617-1944) 
De novo 


CTD(433-822) 
Homology. PDB: 1LDK 


TPR superhelix 
TPR motifs 8-14 (536-824) 
Homology. PDB: 2XPI 


TPR superhelix 

TPR motifs 1-13 (206-755) 
Homology. PDB: 2XPI 
Inserts: De novo 

TPR superhelix 

TPR motifs 8-14 (262-620) 
Homology. PDB: 3HYM 
TPR dimer interface 

TPR motifs 4-7 (167-359) 
Homology. PDB: 2XPI 


TPR superhelix 

TPR motifs 8-14 (288-597) 
Homology. PDB: 2XPI 

IR tail (163-185) 

De novo 

RING domain (21-84) 
X-ray. PDB: 4R2Y 


WD40 domain (169-470) 
Homology. PDB: 4BH6 
ZBR (356-372) 

NMR. PDB: 2M6N 


PDB accession numbers are listed. Phosphorylation site data from refs 71-73. 


Domain/Region 3 

PC domain (1013-1616) 
Homology. PDB: 4ADY 
Inserts: De novo 


TPR superhelix 
TPR motifs 8-14 (360-599) 
Homology. PDB: 2XPI 


IR tail (483-496) 

De novo 

Linker (356-372), De novo 
LR tail (431-447), De novo 


Disordered regions 
1-10,59-70, 135-145, 193-205 ,227- 
235,296-401 ,415-423,514-581,671- 


756,815-838,883-923,987-1012,1334- 
1346, 1438-1451,1712-1726, 1930-1944 


1-14, 30-52,66-69, 103-106, 192-195, 
221-231,305-319,444-450,459-475 
731-822 (WHB) 

171-450, 767-824 (A) 
171-450,782-824 (B) 


429-438,458-469, 758-808 


1-26,170-205 


96-126,528-620 (A) 
96-123,534-620 (B) 


1-35,111-131,553-599 (A) 
1-35,111-131,541-599 (B) 


1-25,501-508,558-597 (A) 
1-25,501-508,558-597 (B) 


26-85 


40,47,68-79 
57-121 

1-51 

1-41 ,68-87,109-124,133-145,164- 
168,471-482 
1-318,336-355,373-430 


1-29 
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Phosphorylation sites 
202,286,291 341,343,355, 
362,364,373,377,537,571, 
681, 688 


218,314,532,534 

205,209,220,244,264,291, 
302,304,313,334,336,364, 
366,369, 383,384,386,426, 


430,435,438,446,447 
469,777 


15,232,195 


112,490,559,560,580,581, 
585,586,599 


51,57,64,119,120,125 


267,556,576,578,582,590 
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Extended Data Table 3 | ae comparing APC/CS*"1-Hs!1 (ternary APC/C) and dissociation constants for UbcH10 (ref. 4) and UbcH10'® 
this work). 


a. Inter-subunit contact surface area A” 


Subunit Ape Apce2 Apc3 Apc4 Apc5 Apcé6 Apc7 Apc8 Apc10 Ape11 Apc12 Ape13 Ape15 Apc16 Cdh1 Emi1 
Apct 

Apc2 2520 

Apc3 360 0 5746 

Apc4 ty) 714 ty) 

Apc5 8744 0 ) 6226 

Apc6 1056 0 4590 0 0 5244 

Apc7 0 0 5354 ty) 0) ty) 5026 

Apc8 4290 0 ty) 0 3722 3772 0 4558 

Apc10 2616 0 2510 0 ) ty) 0 ) 

Apct1 ty) 3402 ) 0 ft) 0 0 ) 0 

Apei11"'N og 666 0 0 0 0 0 0 0 0 

Apc12 246 0 ) 0 ) 5970 0 ) ) ) 

Apc13 0 0 836 0 ) 2068 «=O. 3240 0 ) 0 

Apc15 0 0 0 224 3446 340 0 1082 ) 0 0 0 

Apc16 40 0 2514 0 ) 222 1502 ty) 0 0 0 0 0 

Cdh1 1960 0 1906 0 0 712 0 3022 0 ) 0 0 0 0 

cah1N° 1960 0 ty) 0 ) 500 0 3022 0 ) 0 0 0 0 0 ) 
can" og 0 82 0 0 212 0 0 0 0 0 0 0 0 0 ) 
Cah” 0 0 1824 ) ) 0 0 ) ty) 0 ty) 0 0 0 0 ) 
Emit 732 762 ) 0 ty) 0 ft) 0 704 1300 0 0 0 0 754 

Total 24524 8064 25722 7164 34010 24686 _11882__ 26708 ~_5830 4702 6216 6144 5092 4278 8354 4252 


b. DALI Results 


Subunit Protein RCSB z RMSD_ Aligned Res’ ID Subunit Protein RCSB Z RMSD Aligned Res_ ID 
Idomain — match PDB (A) (N) (N) (%)  /domain match PDB (A) (N) (N) — (%) 
Apce1“*° Sc Cdh1 = 4BH6-E 15.9 4.0 259 3038 Apcé SpApc6 2xXPI 36.9 2.2 481 520 37 
Apc1° ScRpn2  4ADY-B 27.8 3.2 406 802 15 Apc7 SpApc6 2XPI 24.9 7.8 464 518 15 
Apc2‘"? HsCul3  4HXI-B 13.8 7.4 301 3467 Apc8 SpApc6 2XPI 25.5 7.2 443 518 19 
Apc2°"? HsCull 4F52-A 23.5 2.2 237 258 18 Apc3A Apc3B = 1.6 482 497 100 
Apc5"® SpApc6 2XPI-A Apc6A Apc6B - 4.0 486 503 100 
Apc3 SpApc6 2XPI-A 29.6 4.9 456 518 19 Apc7A Apc7B - 2.6 483 483 100 
Apc8A Apc8B - 3.7 389 490 100 


c. Surface Plasmon Resonance Data 


ligand Analyte Kort (S") Kon (M's) Ky (nM)  K“"g(nM) Reference 
UbcH10 Ternary 0.42 + 0.02 1.78+0.2 x10° 215 236 + 29 4 
UbcH10'" Ternary (fast) 4.0+40.2x107 1.70401x10° 31 2442 This study 
(slow) 
46+0.2x10° 1.70+0.1x10° 2345 


Kin, is the ratio of the observed microscopic constants 


a, Inter-subunit contact areas, calculated using CCP4. b, Pairwise structural comparison (DALI) of APC/C subunits. c, Surface plasma resonance data. K*%y, equilibrium dissociation constant; Ken. kinetic 
dissociation constant = ko#/kon. Standard errors of the fit are listed. 
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Galaxies at redshifts 5 to 6 with systematically low 
dust content and high |C 1] emission 


PL Capak!?, C. Carilli?:*, G. Jones”, C. M. Casey®, D. Riechers’, K. Sheth®, C. M. Carollo’, O. Ilbert'®, A. Karim", 


O. LeFevre!®, S. Lilly’, N. Scoville”, V. Smolcic!* & L. Yan? 


The rest-frame ultraviolet properties of galaxies during the first 
three billion years of cosmic time (redshift z > 4) indicate a rapid 
evolution in the dust obscuration of such galaxies’ *. This evolu- 
tion implies a change in the average properties of the interstellar 
medium, but the measurements are systematically uncertain owing 
to untested assumptions** and the inability to detect heavily 
obscured regions of the galaxies. Previous attempts to measure 
the interstellar medium directly in normal galaxies at these red- 
shifts have failed for a number of reasons*°, with two notable 
exceptions’*"’. Here we report measurements of the forbidden 
C emission (that is, [C 11]) from gas, and the far-infrared emission 
from dust, in nine typical star-forming galaxies about one billion 
years after the Big Bang (z ~ 5-6). We find that these galaxies have 
thermal emission that is less than 1/12 that of similar systems about 
two billion years later, and enhanced [C 11] emission relative to the 
far-infrared continuum, confirming a strong evolution in the 
properties of the interstellar medium in the early Universe. The 
gas is distributed over scales of one to eight kiloparsecs, and shows 
diverse dynamics within the sample. These results are consistent 
with early galaxies having significantly less dust than typical gal- 
axies seen at z < 3 and being comparable in dust content to local 
low-metallicity systems’. 

We have obtained rest-frame far-infrared (FIR; 3-1,000 pm) 
measurements of nine ‘normal’ (~1-4 times average luminosity, L+) 
galaxies and one low-luminosity quasar at redshifts of z = 5-6, about 
one billion years after the Big Bang. These data were taken with an 
early 20-antenna version of the Atacama Large Millimeter Array 
(ALMA) in band 7 (1,090-800 um) continuum mode with 7.5 GHz 
of bandwidth, ~20 min of integration, and with the compact array 
yielding a resolution of ~0.6 arcsec. One of the four side bands was 
always centred on the 158 jim [C 1] line, which is the dominant FIR 
cooling line for neutral gas in normal star-forming galaxies'*'* and 
hence is a good indicator of galaxy dynamics and the spatial extent of 
the interstellar medium’*"* (ISM). The other three spectral bands yield 
a measurement of the dust continuum emission at 2 ~ 150 um, which 
is a good indicator of the total infrared luminosity for normal FIR 
spectral energy distributions (SEDs; see Methods section “Derivation 
of physical parameters’). 

The objects we studied were Lyman break galaxies (LBGs) selected 
from the 2 square degree Cosmic Evolution Survey (COSMOS) field’’; 
these LBGs (with one object later discovered to be a low-luminosity 
quasar) have spectroscopically determined absorption line redshifts 
measured by the Deep Extragalactic Imaging Multi-Object Spectro- 
graph (DEIMOS) on the W. M. Keck-II Observatory in Hawaii. The 
objects were selected to be ‘characteristic’ with luminosities between 
1 and 4 L. and with ultraviolet spectral slopes (f) between —1.4 and 
—0.7. Three objects (1, 2 and 10) also meet the selection criteria for 


Lyman-a (Lyx) emitters at z ~ 5.7. When possible, ultraviolet 
morphologies are measured using the Hubble Space Telescope 
(HST) Advanced Camera For Surveys (ACS) F814W (0.8 tm) data’’. 
Stellar masses were determined by fitting SED models to existing 
COSMOS data and new 3-5 tum photometry from the Spitzer- 
SPLASH survey’® (see Methods section ‘Optical and NIR data’). 
Rest-frame infrared luminosities were determined by assuming the 
range of FIR properties measured at z ~ 2-5 (ref. 19) and have a 
systematic uncertainty of 0.3 dex due to the uncertainty in the shape 
of the FIR SED (see Methods section ‘Derivation of physical para- 
meters’). 

We detect four of nine galaxies in the dust continuum (Fig. 1, 
Extended Data Table 1), and a mean combination (stacking) of 
the data on the remaining six objects yields a continuum detection 
of 35 + 13 Jy. This is less than 1/12 of that expected for systems with 
similar ultraviolet properties at z < 3 assuming the worst-case sys- 
tematics in our FIR determination (see Methods section ‘Robustness 
of upper limits on dust emission’). This can be quantified with the 
infrared-excess to ultraviolet-slope (IRX-/) relation* (Fig. 2), which 
relates the amount of dust absorption measured in the ultraviolet to 
the amount of infrared re-emission and is sensitive to both dust 
properties and column density. We find that the measured IRX and 
B for our continuum detected objects and A1689-zD1"' are consistent 
with the IRX-f relation for the Small Magellanic Cloud (SMC), 
which has lower-metallicity gas, and less thermal FIR emission”! than 
typical galaxies at z < 3. Our undetected systems appear to have even 
less IRX than the SMC, but could be consistent given the large sys- 
tematic uncertainties. Hence, we conclude z ~ 6 galaxies are signifi- 
cantly less dust-obscured than more-evolved systems at z < 3, 
confirming previous results'**. 

The very low IRX values at f ~ —1.2 implied by our non-detections 
are difficult to explain in the context of the very young systems 
expected at z = 5-6 (ref. 22). The simplest explanation is the system- 
atics in our estimates of £ and total infrared luminosity (Ly), which 
would imply relatively un-obscured systems with ~45 K dust (see 
Methods section ‘Derivation of physical parameters’). But we cannot 
exclude changes in the dust geometry or rapid disruption of the 
molecular clouds that could lead to lower IRX vales at a given fh 
with more-normal dust temperatures”*. Higher signal-to-noise near- 
infrared photometry and shorter wavelength FIR data will ultimately 
be needed to understand these sources. 

A corollary of this result is that ultraviolet derived star-formation 
rates (SFRs) at high redshift (z > 5) should use the SMC-like IRX-[ 
relation or assume no dust rather than the currently assumed Meurer 
IRX-f relation. This will decrease the ultraviolet derived SFRs by a 
typical factor of ~2-4 for individual galaxies from those implied by 
the Meurer relation for similar values of 6. The effect on the global 
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Figure 1 | Optical, [C m] and continuum maps of the sources HZ1-HZ10. 
The [C u] line detections (red contours) and weak ~158 um FIR continuum 
detections (blue contours) are shown with the rest-frame ultraviolet images as 
the background. The images are 5” < 5” (scale bar at bottom left) and the 
contours are 2, 6 and 10 o with [C ] line profiles for each source shown in 
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Figure 2 | IRX-# measurements of z > 5 objects. The deficit of infrared 
emission in our sample is evident in the presented infrared excess 
versus ultraviolet slope (IRX-/) relation when compared with models. 
Detections are indicated in red, upper limits in orange, the mean IRX 
ratio (obtained by combining undetected sources) in blue, and 
A1689-zD1" in maroon. Error bars are lo, and include standard 
measurement error and systematic uncertainty added in quadrature. 
The Meurer”? relation, which assumes Calzetti-like dust and is 
consistent with typical galaxies at z < 3, is shown as a black solid line, 
while a model for lower-metallicity SMC-like dust model” is shown as a 
dashed line. 
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Fig. 3. The background images are from HST-ACS in the F814W”’ band where 
the morphologies will be affected by Lya, except for HZ10, which is Subaru z’ 
band. All objects are detected in [C 1], showing that a large amount of gas is 
present in these systems, but only four are detected in continuum. 


star-formation history is much smaller, <40%, because the majority of 
star-formation is in low luminosity (<L«) galaxies that were already 
assumed to have little or no dust extinction (see Methods section 
‘Effects of evolving dust on the global star-formation history’, 
Extended Data Fig. 4). 

In contrast to the dust emission, we find >3a detections of the [C 1] 
line in all nine normal galaxies (Fig. 3). The line emission is spectrally 
resolved in all cases, with [C 11] velocity dispersions of 63-163 km sl, 
and marginally resolved at our spatial resolution of ~0.5-0.9 arcsec, 
indicating galaxies with dynamic masses (May) of ~10°-10"? solar 
masses (Mo; see Methods sections ‘Reduction of ALMA data’ and 
‘Derivation of physical parameters’). We also detect two optically faint 
[C 11] emitters at redshifts consistent with the targeted objects. HZ5a is 
detected near HZ5 at a redshift consistent with the in-falling gas seen 
in the optical spectra of HZ5**. HZ8W corresponds to an optically faint 
companion to HZ8 and has a similar redshift. Taken together, the 
direct and serendipitous detections suggest ubiquitous and enhanced 
[C 11] emission in early galaxies similar to that seen in local low-metal- 
licity systems’* (Fig. 4). 

The [C 11] enhancement in local systems is caused by a lower dust-to- 
gas ratio which allows the ultraviolet radiation field to penetrate a 
larger volume of molecular cloud’*. Our significantly lower IRX 
values and enhanced [C 11]/FIR ratios would suggest a similar effect 
is happening in high redshift galaxies. But other possible causes of 
the transition in obscuration properties with redshift have been 
suggested”: evolution in metal abundances that changes the 
intrinsic ultraviolet slope; changes in the dust properties; and dif- 
ferences in the dust geometry. The systems in this study were 
selected to have broad ultraviolet absorption features in their spec- 
tra that indicate a relatively homogeneous metal abundance of 
~0.25 times the solar value. At this metallicity, the ultraviolet spec- 
tral slopes are expected to be similar to those of solar metallicity 
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Figure 3 | [C 1] line profiles for the sources. The line velocity profiles of our 
[C m1] measurements are shown for our 10 sources and two blind detections, 
HZ5a and HZ8W. All sources but HZ5, a quasar, are clearly detected. For 
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Figure 4 | Lc m over [yg ratio as a function of Iyz. Our sample has a 
clear excess of [C 11] emission relative to IR emission when compared with 
normal lower-redshift systems (z = 0-3)*’*'>*°. Detections are indicated 
in red, upper limits in orange, and the mean of undetected sources is 
shown in blue. The high Lic 1y/Lip ratio we measure indicates a lower 
dust-to-gas ratio, more-diffuse clouds, and a more-diffuse ultraviolet 
radiation field than seen in normal z ~ 0 galaxies'*'*'**°. Error bars are 
1o, including standard measurement error and systematics added in 
quadrature. 


objects HZ8 and HZ8W, a dotted line showing the integrated flux over both 
sources is also plotted. The consistency between the integrated and individual 
profiles indicates they are at similar redshifts. 


systems**. Furthermore, the population from which these objects were 
selected has dust attenuation properties similar to those of lower-red- 
shift objects’’. So changes in the dust properties that would be measure- 
able in the attenuation curve are probably not the primary factor. When 
compared with objects of the same stellar masses, our sample has 
physical properties (gas velocity dispersions, sizes, and SFRs) that imply 
geometries consistent with z ~ 1-3 galaxies that exhibit dust properties 
like those found in the Meurer model’*. Changes in the average cloud 
geometry are therefore not obvious, but we cannot fully exclude this 
possibility as it would explain the observed IRX properties. So we con- 
clude that a significant decrease in the dust-to-gas ratio is the most likely 
explanation for the evolution in extinction properties. 

These results are seemingly at odds with previous measurements®° 
that failed to detect [C 11] emission at high redshift. Our sample con- 
tains objects with both emission and absorption of Lya, and the [C 11] 
emission strength is un-correlated with the Lyx properties (Extended 
Data Table 1), which means that the use of Lyx redshifts is not the 
cause. The most likely explanation for the lack of detections is 
observational uncertainty in the expected line flux, line width, and 
frequency. Even obscuration uncorrected ultraviolet SFRs will over- 
estimate the expected FIR continuum flux, because less than 20% of the 
star formation is leading to FIR emission in our sample® (Fig. 2). This 
means the FIR emission of galaxies will be more difficult to detect than 
previously expected based on z < 3 scaling relations (see Methods 
sections ‘Robustness of upper limits on dust emission’ and 
‘Quantifying [C 11] non-detections in the literature’), so longer integ- 
ration times are required. That said, these data show [C 1] is enhanced 
and readily detectable with ALMA at z ~ 6 if moderate integration 
times are used. Finally, we note that the [C 11] lines are resolved in 
velocity space, so single channel sensitivity should not be used for 
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limits (see Methods section ‘Quantifying [C 1] non-detections in 
the literature’). 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Cosmology. Throughout this paper we assume a simplified cosmology of 
Qy = 0.3, Qy = 0.7 and H, = 0.7 when calculating physical parameters. These 
values approximate the 2013 Plank*! and 9-year WMAP” values, and are com- 
monly assumed in the current literature to make comparisons easier over time as 
the exact measured values evolve. 

Reduction of ALMA data. The ALMA observatory staff, as part of standard data 
processing and delivery, performed initial data calibration. The calibrated visibility 
data were then re-analysed, and additional flagging of bad time periods in the data 
and bad channels was performed. The data were re-imaged using the CASA Briggs 
flexible weighting of the u-v visibility data with Robust = 1 to create continuum, 
moment zero, and channel maps. Gaussian fitting of the spectral and spatial data 
was performed using the CASA viewer and CASA fit tool. 

The centroid frequency of these models was compared to the rest frequency of 
[C 11] to measure the redshift, which was compared to the previously determined 
redshifts from optical spectroscopy. Since these measures were identical, within 
our precision, the reference frequency of each cube was set to the centroid fre- 
quency, allowing the FWHM of the Gaussian to be read in kms’ and the line flux 
in Jy kms ‘. The [C mJ line contribution was subtracted from the measured 
continuum fluxes. 

The measurements derived from the ALMA data are given in Extended Data 
Table 1 and Gaussian fits done with CASA to the image components are given in 
Extended Data Tables 2 and 4 for the [C 1] and continuum data, respectively. The 
fits were not constrained by the beam size, and so they can be slightly smaller due 
to the data errors. The errors quoted are those produced by the CASA Gaussian 
fitting routine and are sensitive to the box size chosen for the fit, especially in cases 
of low signal to noise. Deconvolved size measurements for the [C 11] lines are given 
in Extended Data Table 3. Only objects 3 and 5a are unresolved in [C 1]. In 
contrast, only object 10 is resolved in the continuum measurement. The baseline 
configuration used will resolve out spatial scales larger than 5-7” (29-43 kpc) but 
none of our sources appear to have emission on that large a scale. 

Optical and NIR data. These were obtained from the COSMOS photometric 
redshift catalogue’? augmented with additional data from the Spitzer-Large 
Area Survey with Hyper-Suprime-Cam (SPLASH). Sizes were taken from the 
COSMOS Hubble Space Telescope Advanced Camera for Surveys (HST-ACS) 
weak lensing catalogue'® and are given in Extended Data Table 5. Object 10 falls 
outside the area covered by HST-ACS. 

Derivation of physical parameters. Since only one FIR data point was available, 
total infrared (3-1,100 jum) luminosity was derived using the range of grey body 
models covering the full range of objects observed at z ~ 2-5". The grey 
body model parameters, « the blue power-law, f the long wavelength slope, 
and T the black body temperature, ranged between « = 1.5-2.5, 8 = 1.2-2.0 
and T = 25-45 K. The range of allowed infrared luminosities was determined 
by scaling these grey bodies to the observed rest frame 158 1m flux. This range of 
parameters resulted in a systematic uncertainty of 0.3 dex in the derived lumin- 
osity. This relatively small error is due to the fact that 158 j1m is on the flat part of 
the SED close to the emission peak for the range of observed physical parameters. 
The derived infrared luminosities are given in Extended Data Table 5. 

The rest frame ultraviolet properties were determined by fitting a power-law to 
the rest frame 0.13-0.3 j1m photometry from the COSMOS and Ultra-Vista sur- 
veys. This power-law fit was then used to estimate the ultraviolet spectral slope (f) 
and the luminosity at 0.16 um, where Luy = Vi,600LV1,.600. The power-law fits 
appear to be biased high by the bluest bands in several cases which may be due 
to the intrinsic SED flatting at short wavelengths*”*. To account for this possible 
systematic, we also fit the slope using only the rest-frame 0.15-0.3 um photometry 
and include the shifts in our measurement error bars. Finally, simulations” of 
observational biases from blending and signal-to-noise indicate that an additional 
+0.3 systematic error should be included. These estimates are tabulated in 
Extended Data Table 5. 

For consistency, we re-analysed A1689-zD1"' with the same methods used for 
our sample. We find Lip = (6.8 £ 1.2) X 10'°Lo, Luy = (1.8 + 0.4) X 10'°Lo and 
fb = —0.67 + 0.78 assuming a lensing magnification of 9.3 versus Lip = 
(6.2 + 0.8) X 10'°L5 and Lyy = 1.8 X 10'°L5. No value was published for 
derived from the photometry, but our analysis would not have produced a con- 
sistent Luy if the two analyses of / were inconsistent because Luy is derived from 
the f fit. 

Stellar masses were derived using data from the Spitzer Large Area Survey with 
Hyper-Suprime-Cam (SPLASH) in the same way described in previous work”. In 
brief, the COSMOS and SPLASH 0.1-5 um photometry was fitted with Bruzual 
and Charlot templates*’ with a Chabrier IMF allowing for a range of emission line 
strengths and extinction laws. The estimated stellar masses are given in Extended 
Data Table 5. 
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Dynamical masses were estimated based on the [Cu] velocity dispersion (o[c 1) 
and size estimates in Extended Data Table 3. We used the method outlined 
in Wang et al.** to estimate the dynamical masses. In this approximation 
Mayn = 1.16 X 10° Vi, D, where V.;, is the circular velocity in km s 'andD 
is the diameter in kpc. Vi, = 1.763 o1¢ 1)/sin(i), and i, the disk inclination angle, is 
approximated by the axis ratio of the object as i = cos '(b/a). 

This method yields an approximate answer but is very uncertain because of our 
low spatial resolution that leads to a large uncertainty in both i and the object size. 
The errors quoted in Extended Data Table 5 include the error in the semi-major 
axis and the velocity dispersion and assumes sin(i) = 0.45-1. This range of sin(i) 
approximates the range of values from a dispersion dominated system to an edge- 
on disk, but could be significantly larger. 

We find the dynamical masses are typically a factor of ~3 greater than the stellar 

masses (see Extended Data Fig. 3). This is higher than the factor of 1.2-1.7 found at 
z =~ 1-3 from similar Ha based dynamical mass estimates”, but consistent given 
the very large errors in our measurements. 
Robustness of upper limits on dust emission. The conclusion that the dust 
properties, and specifically the dust mass and dust to gas ratio, of galaxies are 
evolving relies on two quantities: the measured deficit of rest-frame 158 j1m flux, 
and the models that translate this into a deficit of thermal emission. Here we 
quantify the robustness of this result. 

If we assume a Meurer et al. like IRX-f relation and the mean of our grey body 
models, the predicted mean ALMA flux of the undetected sources is 833-7,610 Ly 
(given the uncertainty in /), which is >61o discrepant from the measured mean 
of 35 + 13 pJy. Individual sources are discrepant from the Meurer et al. relation 
by >17-490o except for objects 9 and 10 which are consistent given the large 
errors in f. If we assume the maximal range of the systematic error, 45 K dust, 
B = 1.5, « = 1.2, the predicted mean ALMA flux is 447-4,087 Jy, which is >320 
discrepant from the measured mean. Finally we consider an extreme model 
with 100 K dust, 8 = 1.5, « = 1.2; we predict an ALMA flux of 78-710 wy, which 
is 30 discrepant from the measured mean. In this extreme case, objects 4, 9 and 10 
are consistent with the Meurer et al. relation, and the remaining sources are 
discrepant at the >1-30o level. 

Wealso performed a completeness analysis following Casey et al.*° and find the 
low dust emission is unlikely to be due to selection effects in our ultraviolet sample. 
However, we note the result only holds for ultraviolet selected samples. The 
general galaxy population may contain heavily obscured objects that may not be 
easily found by ultraviolet selected surveys. 

Quantifying [C 11] non-detections in the literature. Our sample predicts a peak 
[C 1] line flux of ~0.2-0.7 mJy for an L» (ref. 40) galaxy with an SFR ~ 6.5 at 
z = 6-7, below the detection limit of most previous studies of z ~ 6-7 objects*”. 

In addition to sensitivity, we note Lyx redshifts are systematically offset by 
~400 km s~' and up to 1,200 km s_' from the [C 11] (and systemic absorption 
line) redshifts", which is a significant fraction of the ALMA band pass. 

Finally, caution must be used when interpreting the limits given in the SFR-Ljc 

uy Felation because the estimated line luminosity limits scales as the square root of 
the assumed line width. A 36-100 km s~' FWHM channel width*” is typically 
used for these limits, but our measured line widths in Extended Data Table 1 are 
significantly broader. If we use our measured line widths this would predict for 
[C 11] a line limit 0.2-1.0 dex higher than those typically quoted, making them 
consistent with the SFR-Lyc 1 relation. 
Comparison of gas and SFR properties to general redshift scaling relations. 
This sample is enhanced in Lic 1 relative to Ljz compared with lower-redshift 
samples® (Fig. 4). These sources have ~ 1% of their luminosity emitted in the [C 11], 
consistent with low metallicity and extended star forming objects'*'*. 

This sample appears to follow the local SFR-Ljc 1 relation if the SFR is deter- 
mined by summing the ultraviolet and FIR measurements® rather than using a 
dust corrected ultraviolet estimate (Extended Data Fig. 1). However, if an ultra- 
violet derived dust correction is used, the points shift ~0.5-1 dex higher. 

Finally we turn to the stellar-mass-SFR or ‘main-sequence’ relation (Extended 
Data Fig. 2)'*. We find our objects generally agree with this relation, but several 
objects fall below the relation. Specifically, objects HZ1, HZ2 and HZ3 are below it, 
but HZ1 has an unusually low dynamical to stellar mass ratio, indicating the stellar 
mass may be over-estimated. 

Effects of evolving dust on the global star-formation history. Our results indi- 
cate the properties of dust and the amount of extinction evolve significantly 
between z ~ 3, where it is well measured, and z ~ 5-6, where we measure it. 
To estimate the effect on the global star-formation history, we adopt the lumin- 
osity functions and ultraviolet slope measurements from the series of papers by 
Bouwens et al.*"”, which are largely supported by other work. We then assume 
the dust correction can be described by the Meurer” curve at z < 4, the SMC” 
curve at z ~ 5, and the SMC curve scaled to our measured IRX at z ~ 6-8. 
The results of this analysis are shown in Extended Data Fig. 4 and result in 
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~30-40% less star-formation at z ~ 6 than previously thought. For consistency 
with the literature, the SFR density is shown assuming a Salpeter IMF; a factor of 
0.68 needs to be applied to convert to a Chabrier IMF. 

Sample size. No statistical methods were used to predetermine sample size. 
Randomization. The experiments were not randomized. 

Blinding. The investigators were not blinded to allocation during experiments 
and outcome assessment. 
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Extended Data Figure 1 | [C 1] flux as a function of total star-formation 
rate, SFR. Our sample is consistent with the low-z SFR-Ljc y relation from the 
literature at z = 0-6 (ref. 6). The best fit to the literature points from ref. 44 
is indicated with a solid line. The SFRs are derived using the UV + FIR method 
and a Chabrier IMF"*. The dust-corrected ultraviolet estimates with a Meurer 
relation typically used at these redshifts would increase the estimated SFR by 
a factor of ~2-10 (0.5-1 dex), leading to over-estimates of the expected 

[Cu] flux. The Ljc iy error bars are 1o standard measurement error, while the 
SER errors are 1o from a combination of measurement error in Ljp and Lyy 
converted to star formation added in quadrature. 
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Extended Data Figure 2 | Star-formation rate as a function of stellar mass. 
Our objects are generally consistent with the stellar mass-SFR or ‘main 
sequence’ of star forming galaxies at z ~ 5-6 (ref. 18). Three objects fall below 
the relation, namely HZ1, HZ2 and HZ3, but the stellar mass of HZ1 may 

be over-estimated (“Derivation of physical parameters’ in Methods). Mass 
errors reflect the 1a range of masses allowed by the SED model fits including 
emission line strength as a free parameter. SFR errors are 1o from a combi- 
nation of measurement error in Lyg, systematics in Lyz, and measurement 
error in Lyy converted to star formation added in quadrature. 
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Extended Data Figure 3 | Dynamical mass compared with stellar mass. A 
comparison of the dynamical masses estimated from the [C 1] line and stellar 
masses estimated from SED fitting is shown. The dotted line indicates equal 
stellar and dynamical masses. The average dynamic to stellar mass ratio is 
~3, which is higher than, but consistent with, similar measurements at z ~ 1-3 
(ref. 28). Errors on the dynamical masses reflect the 1¢ measurement 
uncertainty in the size, and velocity dispersion of the sources, but the geometry 
is limited to sin(i) = 0.45-1 (“Derivation of physical parameters’ in Methods). 
The errors on the stellar masses reflect the 1o range of masses allowed by 

the SED model fits including emission line strength as a free parameter. 
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Extended Data Figure 4 | The star-formation rate history of the Universe. 
The global SER history at z > 4 derived from UV measurements’ is shown for 
three different assumptions about the dust obscuration in the general galaxy 
population. Blue, no correction for dust (‘direct UV measurements’ in key); red, 
a dust correction assuming the Meurer relation” (“UV corrected with Meurer 
et al.’); green, a correction which linearly evolves in redshift between a 
Meurer relation at z ~ 4, the SMC-like relation at z ~ 5, and our measured 
value at z ~ 6 (‘UV corrected with evolving dust’). Note the evolving dust 
correction leads to a downward revision by 30-40% at z ~ 6. Redshift error bars 
reflect the binning of the data, errors in the SFR density reflect the 1a 
measurement error in the ultraviolet luminosity density. 
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Extended Data Table 1 | Measured source properties 
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Redshift 


W pact Redshift Continuum Flux Line Flux o [Cll] 

ID RA DEC (UV) Selection ci)" (uJy) (Jy km/s) (km/s)* 
HZ1 149.971869 2.118175 5.690 LAE/I-LBG 5.6885(2) <30 0.29+0.04 72+11 

HZ2 150.517074 1.928902 5.670 LAE/I-LBG 5.6697(6) <29 0.42+0.07 134+25 
HZ3 150.039297 2.337183 5.546 I-LBG 5.5416(2) <51 0.56+0.07 91412 
HZ4 149.618835 2.051873 5.540 BBG 5.5440(2) 202+47 1.14411 108+11 

Hz 150.214993 2.582653 5.310 V-LBG No data <32 <0.02 No data 
HZ5a =: 150.214667 = 2.583136 ~— No Data [Cll] 5.3089(1) <32 0.18+0.02 2043 
HZ6 150.08960 2.586411 5.290 V-LBG 5.2928(1) 129+36 1.80+0.13 95+7 

HZ7 149.876986 2.134113 5.250 BBG 5.2532(4) <36 0.71+0.07 162+18 
HZ8 150.016896 2.626614 5.148 R-LBG 5.1533(2) <30 0.34+0.03 10729 
HZ8W = =150.016546 2.626731 No Data [Cll] 5.1532(1) <30 0.2740.03 6347 
HZ9 149.965406 2.378379 5.548 R-LBG 5.5410(1) 516442 1.95+0.07 152+6 
HZ10 150.247098 1.555426 5.659 LAE/I-LBG 5.6566(2) 1261444 1.57+0.09 127+8 


RA, right ascension; DEC, declination. 


*This source is the same object reported as LBG-1 in ref. 25. 


+ LAE, Narrow band selected Lyx emitter; |-LBG, | band dropout Lyman-break galaxy; R-LBG, R band dropout Lyman-break galaxy; V-LBG, V band dropout Lyman-break galaxy; BBG, Balmer break galaxy, selected 
with photo-z and 4.5 jm flux; [C 1!], [C 1] detection in these data. 
¢ Velocity dispersion, FWHM = 2.350. 
§ Chandra detected quasar, shows broad lines in spectra. 


|Redshift error is +4 x 10-3 


The numbers in parentheses indicate the errors in the final digits of the redshifts. 
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Extended Data Table 2 | Gaussian fits to ALMA [C 11] moment zero images 


ID Beam Major Beam Minor Beam PA Source Major Source Minor Source PA 
Axis (‘“) Axis (“) (Degrees) Axis (“) Axis (“) (Degrees) 
HZ1 0.82 0.51 64.2 0.93+0.08 0.57+0.03 69.744.2 
HZ2 0.91 0.5 64.9 1.46+0.37 0.69+0.08 79.345.6 
HZ3 0.87 0.4 63.0 1.02+0.26 0.59+0.1 94412 
HZ4 0.90 0.48 59.7 1.22+0.15 0.61+0.05 43.144 
HZ5 0.61 0.49 65.1 No Data No Data No Data 
HZ5a 0.61 0.49 65.1 0.93+0.16 0.58+40.07 120410 
HZ6 0.68 0.48 65.2 1.07+0.14 0.8340.09 48417 
HZ7 0.71 0.49 76.8 1.19+0.22 0.65+0.08 72.9+7.9 
HZ8 0.71 0.49 61.5 1.4140.21 1.0140.13 35416 
HZ8W 0.71 0.49 61.5 1.05+0.13 0.55+0.04 75.344.2 
HZ9 0.66 0.53 75.8 0.94+0.07 0.66+0.04 76.2+6.1 
HZ10 0.69 0.51 60.5 1.04+0.06 0.60+0.02 78.7+2.8 


PA, position angle. 
* Measured source sizes, not de-convolved. 
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Extended Data Table 3 | De-convolved sizes of [C 11] moment zero images. 
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ID Major Minor PA Major Axis Minor Axis 
Axis (“) Axis (“) (Degrees) (kpc) (kpc) 
HZ1 0.45+0.24 0.19+0.1 84459 2.641.4 1.140.6 
HZ2 1.17+0.38 0.4140.38 86221 6.942.2 2.442.2 
HZ3 <1.02 <0.59 +180 <6.1 <3.5 
HZ4 0.87+0.24 0.24+0.16 29418 5.241.4 1.441.0 
HZ5a <0.96 <0.33 +180 <5.8 <2.0 
HZ6 0.85+0.21 0.65+0.20 35260 §.241.3 4.041.22 
HZ7 0.96+0.30 0.42+0.17 72423 5.941.8 2.6+1.0 
HZ8 1.24+0.28 0.84+0.23 28448 Pee eed 5.241.4 
HZ8W 0.79+0.19 0.19+0.14 81212 4.941.2 1.240.9 
HZ9 0.66+0.09 0.39+0.07 76414 3.920.5 2.340.4 
HZ10 0.80+0.09 0.28+0.08 85.345.9 4.7+0.5 1.740.5 
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Extended Data Table 4 | Gaussian fits to ALMA continuum images* 


ID Beam Major Beam Minor Beam PA Source Major Source Minor Source PA 
Axis (“) Axis (“) (Degrees) Axis (‘“) Axis (“) (Degrees) 
HZ4 0.87 0.48 59.2 0.82+40.07 0.44+0.02 56.442.9 
HZ6 0.71 0.50 65.5 0.72+0.06 0.45+0.02 73.944.2 
HZ9 0.63 0.51 76.6 0.72+0.06 0.56+0.04 93411 
HZ10 0.66 0.49 61.2 1.16+0.11 0.59+0.03 83.243.1 


* Only HZ10 is resolved in the continuum with a de-convolved size of major axis = 0.98 + 0.13”, minor axis = 0.27 + 0.12”, PA = 87.5 + 5.5. 
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Extended Data Table 5 | Derived physical properties 
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7 7 
ID Leun UV" Leun IR™ iy ty beun [CH] Beta® To nea a ae 
HZ1 —-11.15+0.05 <10.32 24** 8.40+0.32 -1.4120.13 1.53 10.47+0.13 9.8" 
HZ2 —-11.16+0.03 <10.3 25*6 8.56+0.41 — -0.99* 02" 0.59 10.23+0.15 10.7+18 
HZ3 11.02+0.07 <10.53 18*8 8.6740.28  -1.05+0.5 0.66 10.23+0.35 <10.4+16 
HZ4 —-11.26+0.05  11.13+0.54 51% 8.9820.22 -1.2040.36 —0.72 9.67+0.21 10.4716 
HZ5" —-11.46+0.02 <10.3 <3 <7.2 -0.77+021 0.37 No data No data 
HZ5a <10.3 <10.3 <3 8.15+0.27 No data No data No data <9.0718 
HZ6 —-11.3340.03 10.91+0.64 49°" 9.1520.17 — -1.13°028 3.36 10.17+0.15 10.6* )8 
HZ7 —-11.08+0.04 <10.35 213 8.7440.24 -1.24+0.24 0.98 9.86+0.21 10.8* 1% 
HZ8 = 11.02+0.06' — <10.26 18*8 8.4120.18 — -1.42*O 18 1.24 9.77+0.15 10.8*}8 
HZ8W 10.57+0.15" <10.26 te 8.3120.23  Nodata 1.49 No data 9.90 
HZ9 ——- 10.93+0.05  11.54+0.19 6773 9.21+0.09 -0.85* 0 SF 0.95 9.86+0.23 10.7714 
HZ10 —_11.35+0.06 _11.94+0.08 169° %* 9.13+0.13 <-0.6" No data 10.390.17 10.5* 14 


* Monochromatic measured luminosity at rest frame 1,600 Awith no correction for obscuration. My600 = —2.5Luy + 5.814. 
+ £0.3 systematic uncertainty in addition to quoted statistical errors. 
$3-1,100 um integrated luminosity. 


§ Systematic error is an additional +0.3 dex based on simulations. 


||The total ultraviolet luminosity for both components was measured as 11.15 + 0.04 and split between the two components based on the ratio of their observed z’ magnitudes. 
{Based on the 5e non-detection in H band. 
#Known X-ray detected quasar. 

** The low SNR H band point is a significant outlier, error reflects range with that point included and rejected. 
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A giant comet-like cloud of hydrogen escaping the 
warm Neptune-mass exoplanet GJ 436b 


David Ehrenreich’, Vincent Bourrier', Peter J. Wheatley’, Alain Lecavelier des Etangs*"*, Guillaume Hébrard?**, Stephane Udry', 
Xavier Bonfils®’, Xavier Delfosse®’, Jean-Michel Désert®, David K. Sing’ & Alfred Vidal-Madjar*4 


Exoplanets orbiting close to their parent stars may lose some frac- 
tion of their atmospheres because of the extreme irradiation’~. 
Atmospheric mass loss primarily affects low-mass exoplanets, lead- 
ing to the suggestion that hot rocky planets’ ° might have begun as 
Neptune-like’® '’, but subsequently lost all of their atmospheres; 
however, no confident measurements have hitherto been available. 
The signature of this loss could be observed in the ultraviolet spec- 
trum, when the planet and its escaping atmosphere transit the star, 
giving rise to deeper and longer transit signatures than in the 
optical spectrum’’. Here we report that in the ultraviolet the 
Neptune-mass exoplanet GJ 436b (also known as Gliese 436b) 
has transit depths of 56.3 + 3.5% (10), far beyond the 0.69% optical 
transit depth. The ultraviolet transits repeatedly start about two 
hours before, and end more than three hours after the approxi- 
mately one hour optical transit, which is substantially different 
from one previous claim® (based on an inaccurate ephemeris). 
We infer from this that the planet is surrounded and trailed by a 
large exospheric cloud composed mainly of hydrogen atoms. We 
estimate a mass-loss rate in the range of about 10°-10° grams per 
second, which is far too small to deplete the atmosphere of a 
Neptune-like planet in the lifetime of the parent star, but would 
have been much greater in the past. 

Three transits of GJ 436b, which occur every 2.64 days, were 
observed on 7 December 2012 (ref. 6) (visit 1), 18 June 2013 (visit 2) 
and 23 June 2014 (visit 3) with the Space Telescope Imaging 
Spectrograph (STIS) on board the Hubble Space Telescope (HST). A 
stellar spectrum acquired using similar settings in January 2010 (ref. 
17) (visit 0) was retrieved from the archive for comparison purposes. 
HST data in visits 2 and 3 were complemented with simultaneous 
Chandra X-ray observations. The HST data consist of time-tagged, 
far-ultraviolet spectra obtained with a grating dispersing light over 
the 1,195-1,248 A domain, with a spectral resolution of ~20 km s | 
at 1,215.6 A (the Lyman-o transition of atomic hydrogen, H)). 
Exposure times of 1,500 s to 2,900 s were used to observe the star 
for four successive HST orbits during each visit. Each HST orbit lasts 
for 96 min, during which GJ 436 is visible for 56 min before being 
occulted by the Earth, yielding 40 min gaps in the data. 

The most prominent spectral feature is the H1 Lyman-« emission 
(Fig. 1). Absorption in the blue wing of this line has been reported in 
other systems, during transits of hot Jupiters. This is interpreted by 
the presence of escaping hydrogen exospheres surrounding giant 
planets’*"*°. Tentative evidence that the Neptune-mass planet 
GJ 436b possesses such an extended atmosphere was drawn from visit 
1 data despite the signal being observed after one optical transit®, 
raising questions about its planetary origin. Visits 2 and 3 were carried 
out to determine the signal nature. 

We performed a careful analysis to check for the existence 
of instrumental systematics in the data and correct for them 


(see Methods). Large variations are detected over a part of the 
Lyman-« line at times corresponding to the optical transit, which 
cannot be explained by any known instrumental effects. The most 
notable absorption occurs in the blue wing of the line for radial 
velocities between —120 km s_! to —40 km s !, ~2 h before the 
optical mid-transit time (‘pre-transit’), during the optical transit 
(‘in-transit’) and ~1 h after the optical mid-transit time and beyond 
(‘post-transit’) in the three visits (Fig. 1 and Extended Data Fig. 1). 
In-transit, over half the stellar disk (56.2 + 3.6%, 1a) is eclipsed 
(Fig. 2a). This is far deeper than any ultraviolet transits of hot 
Jupiters and significantly (~90) deeper than the 22.9 + 3.9% (le) 
post-transit signal previously reported in visit 1 data®. The ultraviolet 
transit also starts much earlier (~2.7 h) than claimed previously; the 
difference is mainly due to our finding of a pre-transit absorption 
and updated transit ephemeris (see Methods). Visit 0 data’? acquired 
~3 years before visit 1 and ~30h after transit show that the out-of- 
transit variability is small compared with the blue-shifted signature. This 
gives us confidence that the true out-of-transit baseline is measured in 
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Figure 1 | Evolution of the hydrogen Lyman-a. emission line of GJ 436. The 
line has been averaged over out-of-transit (black), pre-transit (blue), in-transit 
(green) and post-transit (red) observations from individual spectra (see 
Extended Data Fig. 1). The 1o uncertainties have been propagated accordingly 
from the errors calculated by the STIS data reduction pipeline. The line profile 
from visit 0 is shown for comparison with a dotted grey line. The line core 
(hatched region) cannot be observed from Earth because of the interstellar 
medium absorption along the line of sight. Absorption signals are measured 
over the interval [—120,—40] km s_‘ (blue dashed lines) and compared to a 
control measure over the interval [+30,+200] kms ‘ (red dashed lines). The 
velocity scale is heliocentric. 
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Figure 2 | Lyman-a transit light curves of GJ 436b. a, b, Data are from visit 1 


(circles), visit 2 (stars), visit 3 (squares) and visit 0 (triangles). All uncertainties 
are lo. a, The Lyman-« (Lya) line is integrated over [—120,—40] km s! 
and shows mean absorption signals with respect to the out-of-transit flux of 
17.6 + 5.2% (pre-transit), 56.2 + 3.6% (in-transit) and 47.2 + 4.1% (post- 
transit). b, The line is integrated over [+30,+200] km s ‘and shows no 
notable absorption signals: 0.7 + 3.6% (pre-transit), 1.7 + 3.5% (in-transit) and 
8.0 + 3.1% (post-transit). With a depth of 0.69%, the optical transit (thin black 
lines in a and b) is barely seen at this scale between its contact points (dotted 
lines in a and b). A synthetic light curve (green) calculated from the three- 
dimensional numerical simulation” is overplotted on the data in a. 


visits 2 and 3, whereas it is missing in visit 1. By contrast, the flux 
remains stable over the whole red-shifted wing of the line (Fig. 2b). 
The decrease of the red-wing flux seen® during the post-transit phases 
of visit 1 is not reproduced during visits 2 and 3. The mean post-transit 
red-shifted signal is compatible with no detection at the 3a level. 

Our combined analysis of X-ray and ultraviolet data (see Methods) 
shows that stellar magnetic activity cannot explain the observed 
decrease at Lyman-a. We propose that the asymmetric absorption is 
caused by the passage of a huge hydrogen cloud, surrounding and 
trailing the planet (Fig. 3). The planetary atmosphere is an obvious 
source for this hydrogen. To produce this extinction signature, we 
estimate that an ellipsoidal, optically thick cloud of neutral hydrogen 
should have a projected extension in the plan of the sky of ~12 stellar 
radii (Ry + 0.44R.5) along the orbital path of the planet and ~2.5R, in 
the cross direction, well beyond the planet Roche lobe radius (0.37R,). 
Since GJ 436b grazes the stellar disk during transit, we surmise that 
a central transit would have totally eclipsed the star. This could 
happen in the case of other red dwarfs exhibiting central transits from 
planets similar to GJ 436b. Future ultraviolet observations of systems 
similar to GJ 436 could potentially reveal total Lyman-« eclipses. 

The radial velocity interval of the absorption signal constrains the 
dynamics of the hydrogen atoms and the three-dimensional structure 
of the exospheric cloud. The whole velocity range is in excess of the 
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Figure 3 | Particle simulation showing the comet-like exospheric cloud 
transiting the star, as seen from Earth. GJ 436b is the small black dot 
represented at mid-transit at 0.8521R, (ref. 26) from the centre of the star, 
which is represented by the largest black circle. The dotted circle around the 
planet represents its equivalent Roche radius. The colour of simulation particles 
denotes the logarithm of the column density of the cloud. The transit of this 
simulated cloud gives rise to absorption over the blue wing of the Lyman-« line 
as shown spectrally in Extended Data Fig. 2 and by the synthetic light curve 
in Fig. 2a. 


planet escape velocity (~26 km s_' at the planet surface), consistent 
with gas escaping from the planet. The acceleration mechanism of 
hydrogen atoms escaping from highly irradiated hot Jupiters is 
debated: after escaping the planets with initial velocities dominated 
by the orbital velocity (~100 km s~' for GJ 436b in the host star 
reference frame), atoms are submitted to the stellar radiation pressure, 
interact with the stellar wind and are eventually ionized by stellar 
extreme ultraviolet (EUV; 10-91.2 nm) radiation. For strong lines such 
as Lyman-o, radiation pressure can overcome the stellar gravity, repel- 
ling the escaping atoms towards the observer and producing a blue- 
shifted signature. In one hot Jupiter (HD 189733b), the absorption 


A ieee a 
Rao SS ae 

x NE ee eee 

oN te. Oe Rs 


I 
A 
ST 


if? 
8 \\ Sim: 
B _¢ Gets SN | J-6o 2 
a ] = 
ENAN 4 | J-100~ 
= ee 1 
OANA 1 4-120 
—12 Hk, Weta AW AWA A MN pot 
44 12 10 8 6 4 2 0 2 4 


Distance (R,) 


Figure 4 | Polar view of three-dimensional simulation representing a slice 
of the comet-like cloud coplanar with the line of sight. Hydrogen atom 
velocity and direction in the rest frame of the star are represented by arrows. 
Particles are colour-coded as a function of their projected velocities on the line 
of sight (the dashed vertical line). Inset, zoom out of this image to the full spatial 
extent of the exospheric cloud (in blue). The planet orbit is shown to scale 
with the green ellipse and the star is represented with the yellow circle. 
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observed at large blue-shifts is best explained by charge exchange 
interaction with the stellar wind, creating energetic neutral atoms 
with large blue-shifted radial velocities’?””. In other cases’"*”’, radi- 
ation pressure alone explains the observed radial velocities of the 
escaping gas. 

We ran a three-dimensional numerical simulation of atmo- 
spheric escape” to understand the origin of the absorption signature 
observed at GJ 436b. The line profile corrected from interstellar 
absorption’”** is used to calculate the stellar radiation pressure on 
hydrogen atoms. These are released isotropically from the Roche 
lobe limit of GJ 436b. The calculation takes the orbital eccentricity 
of GJ 436b (e = 0.15) into account. The main parameters of the 
atmospheric escape model are the mass-loss rate m of hydrogen lost 
by the planet and the photo-ionization rate %:uy of hydrogen atoms. 
The model computes the structure of the escaping gas cloud, its 
radial velocity absorption signature, and the integrated light curve. 
Adjusting for the spectra at different phases (Extended Data Fig. 2), 
we determine that a family of models, with parameters in the lo 
ranges of m ~ 10°-10° g s | and oyy ~ 8X 10 7-3 X10 °s ! 
(implying neutral atom lifetimes of ~4-18 min at the distance of the 
planet), provide good fits to the data, yielding reduced 7” values close 
to unity for 544 degrees of freedom. A synthetic light curve is plotted 
in Fig. 2. The model correctly matches the ~2 h early ultraviolet 
transit ingress observed with respect to the optical transit, as well as 
the transit depth in the correct range of velocities. It provides a 
good match to the reanalysed visit 1 data. It furthermore predicts that 
the ultraviolet transit could last up to ~20 h after the optical transit, 
due to the extended hydrogen tail of the exospheric cloud. More 
ultraviolet observations will be needed to verify this prediction. 

According to our simulation, the stellar radiation pressure counter- 
balances < 70% of the star’s gravity pull on escaping atoms, which is 
much less than in hot Jupiter systems, in which radiation pressure 
takes over stellar gravity by factors of 3 to 5 (ref. 20). The low stellar 
radiation pressure at GJ 436b allows the formation of a large coma and 
tail of escaping atoms, co-moving with the planet although not grav- 
itationally bounded to it (Fig. 4). 

Atmospheric escape is involved in the possible loss of a whole popu- 
lation of irradiated exoplanets'’'*'*!°. The average mass-loss rate of 
~5 X 10° gs | at GJ 436b means that the planet loses ~0.1% of its 
atmosphere per billion years (assuming it accounts for 10% of the 
planet mass, like Neptune). This rate requires ~1% efficiency in the 
conversion of input X and EUV energy (estimated in Methods) into 
mass loss**. In the past, an M dwarf such as GJ 436 would have been 
more active and the planet could have received $100 times more 
X-ray and EUV irradiation over ~1 Gyr”, resulting in a possible loss 
of <10% of its atmosphere during the first billion years. This planet 
thus stands on the edge of considerable mass loss, leading us to surmise 
that closer-in Neptunes could have evolved more dramatically because 
of atmospheric escape. 

This 160 detection implies that large atmospheric signals from 
comet-like exospheres around moderately irradiated, low-mass pla- 
nets could be retrieved in the ultraviolet. Over ~ 10,000 systems similar 
to GJ 436 will be discovered by upcoming transit surveys carried out 
from the ground and from space—for example, with the K2, CHEOPS, 
TESS and PLATO missions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Observation log. This work is mainly based on observations acquired in the 
framework of the joint HST-Chandra programme “Properties and dynamics of 
the upper atmosphere of the hot-Neptune GJ 436b” (GO number 12965; principal 
investigator: D.E.). The programme consisted of three HST/STIS transit observa- 
tions, with contemporaneous coverage of the pre-transit phases with Chandra/ 
ACIS-S. During the first joint HST-Chandra visit, initially scheduled on 16 
February 2013, we only obtained Chandra data as a last-minute mission sched- 
uling change prevented the HST observation. It was rescheduled, together with a 
new Chandra observation, on 18 April 2013. However, no useful HST data could 
again be obtained on this date because of a target pointing problem. Meanwhile, 
the simultaneous Chandra observation was correctly performed. The remaining 
two HST-Chandra visits were smoothly carried out on 18 June 2013 and 23 
June 2014. 

We complement our new data sets with archive HST/STIS observations of GJ 
436: (1) a short observation of the star obtained ~30 h after the planetary transit on 
5 January 2010 (HST programme GO/DD number 11817 entitled “Detecting the 
upper atmosphere of a transiting hot Neptune: a feasibility study”; principal 
investigator: D.E.’’); and (2) a transit observation obtained on 7 December 2012 
(HST programme GTO number 12034 entitled “Brown dwarf activity part 2”; 
principal investigator: J. C. Green). The 7 December 2012 transit observation was 
previously published® and is discussed later. All HST and Chandra data are listed 
in Extended Data Table 1. 

Similarly, we add archive X-ray data of GJ 436 to our new Chandra observa- 
tions. These are (1) ROSAT All-Sky Survey data”’ and (2) XMM-Newton data’® 
(observation identification number 556560101; principal investigator: P.J.W.). 
Data analysis and correction of systematics. Because Lyman-« observations are 
photon-starved, the exposures are as long as the visibility of the target by HST 
allow. This yields spectra with exposure times of ~1,500 s to ~2,900 s, separated 
by ~40 min gaps. The data are acquired in time-tagged mode, which means that 
the arrival time of each photon hitting the detector is recorded by the far-ultra- 
violet multi-anode microchannel array (FUV-MAMA) of STIS. It is thus possible 
to split the exposures into several sub-exposures. We adjusted the sub-exposure 
time to obtain a good compromise between time sampling and signal-to-noise 
ratio. (The time sampling is important to assess systematic effects, as described 
later.) During visit 1, the total exposure time was 1,515 s (orbit 1) and 2,905 s 
(orbits 2 to 4). The first orbit in a visit always has less science time available before 
of the target acquisition. We used 4 X 380 s sub-exposures (orbit 1) and 5 X 581 
sub-exposures (orbits 2 to 4). During visits 2 and 3, the total exposure time was 
1,670 s (orbit 1) and 2,071 s (orbits 2 to 4). This was subdivided into 4 X 417 s sub- 
exposures (orbit 1) and 4518 s sub-exposures (orbits 2 to 4), respectively. 
Extended Data Figs 3-5 show these time series, which are binned by a factor of 
2 in Fig. 2 in the main text. 

Variation of the temperatures in the optical telescope assembly and on the focal 
plan of HST during its revolution around the Earth is surmised to cause small 
changes of focus (for example, due to variation in the distance between the two M1 
and M2 mirrors). For a given line of sight (hence for a given illumination pattern of 
the telescope), these variations depend on the HST orbital phase. They are con- 
sequently reproducible over several consecutive HST orbits, which means that 
their effect on the data, when present, can be corrected. This effect, known as 
the ‘telescope breathing’ effect, has been observed and corrected in STIS/CCD 
optical data” as well as in STIS/FUV-MAMA Lyman-o data’. 

We use the red part of the Lyman-« line, within the velocity range [+20,+250] 
kms |asa probe for the systematic effects in general, and for the breathing effect 
in particular. In fact, the red peak and wing of the line are the only part of the 
spectrum unaffected (a priori) by the planet absorption and where there is enough 
flux to detect time-correlated effects. For each HST visit, we generate a raw light 
curve by integrating the Lyman-z flux over the velocity range given earlier, using 
the time-tag split exposures. These light curves are plotted in Extended Data Figs 
3a, 4a and 5a, for visits 1, 2 and 3, respectively. If present in the data, the breathing 
effect would appear as a correlation in the light curves phase-folded over the HST 
orbital period (96 min). The phase-folded light curves for the three visits are shown 
in Extended Data Figs 3b, 4b and 5b. We then attempt to fit a polynomial to each 
phase-folded light curve and examine the dispersion of the residuals in the 
observed minus calculated data. When a significant correlation is found (as in 
visits 2 and 3), a phase-dependent correcting coefficient is applied to the whole 
spectrum. 

For visit 1 data, we do not notice any significant correlation and find that a 
constant (0th-order polynomial) provides a better fit to the data (y* = 8.92 for 16 
degrees of freedom with a Bayesian information criterion (BIC) of 17.04) than any 
higher-order polynomials. Consequently, we do not apply any breathing correc- 
tion for this visit. For visit 2 data, we do find an improvement (over a constant) in 
the fit of the phase-folded light curve with straight lines (7 = 0.88 for 14 degrees of 


freedom, BIC = 7.12) and apply this correction to the raw data. For visit 3 data, we 
find a linear correlation in the phase-folded light curve. We correct the raw data 
with the best-fit straight lines (y* = 2.23 for 14 degrees of freedom, BIC = 12.7). 
The light curves corrected for breathing effect are shown in Extended Data Figs 4c 
and 5c for visits 2 and 3, respectively. 

After having corrected for HST orbital phase-dependent effects, we now look at 
instrumental effects occurring on the timescale of a full HST visit (four HST 
orbits). Such effects typically include ‘ramps’ linked with long-term thermal 
relaxation of HST instruments. Our starting point is the Lyman-o red-wing light 
curves corrected for the breathing effect. Visit 1 still does not show any time 
correlation. In contrast, the visit 2 and 3 data show hints for a flux linearly 
decreasing with time. We chose to correct for this trend, by fitting a straight line 
to the data. This procedure yields a time-dependent correction coefficient for each 
visit. The final light curves corrected for (divided by) these trends are shown in 
Extended Data Figs 4d and 5d. The correction coefficients are applied to all spectra 
in visits 2 and 3. The data analysis and interpretation presented in the main text is 
based on these corrected spectra. These spectra are plotted for each visit in 
Extended Data Fig. 1. They clearly show that the signal is reproduced at each 
epoch, which strengthens our detection. In the main text, Fig. 1 presents these 
spectra averaged over visits 1, 2 and 3. 

The amplitude of the correction of systematic effects performed as described 

earlier, either at the timescale of one HST orbit (breathing) or one HST visit 
(normalization), is small with respect to the absorption signal detected on the blue 
part of the Lyman-« line. The blue absorption would show up around the transit 
epoch almost unchanged, even in the absence of both corrections described earlier. 
Finally, the absorption signal by the exospheric cloud is so large that it can be 
measured directly in physical flux units. Our detection is thus independent of any 
normalization of the transit light curve. 
Determination of the velocity interval of the absorption signature. In each 
individual spectrum ofa given HST visit, we searched for the most significant (best 
signal-to-noise ratio) absorption signature. We scanned the velocity range 
[—250,+250] km s_’, excluding the low signal-to-noise-ratio region between 
—30km s~' and +20 km s_', for absorption signals extending over at least three 
contiguous elements of resolution. The signals are estimated from the difference 
between the spectrum and the reference out-of-transit spectrum for each visit. For 
visits 2 and 3, we simply used the data collected during the first HST orbits. We 
indifferently searched for absorption or emission signals, but found only signifi- 
cant (>3q) absorption features. We found that the range [—120,—40] km s 
captures most of the absorption features detected during these two visits. For 
visit 1, where the first orbit is already absorbed (since it is in the pre-transit phase), 
we could not rely on contemporaneous out-of-transit data for reference. We thus 
adopted the range used for visits 2 and 3. 

On the basis of the assumption that observations obtained during the first HST 

orbit of visit 1 are not absorbed, ref. 6 measured a 22.9 + 3.9% (1¢) absorption 
signal, with respect to this first HST orbit, over the range of [—215,—17] km sl. 
We verified that we would obtain a similar result (~24%) by making the same 
assumptions as in ref. 6. However, as we show here, the first exposures obtained 
during visit 1 already exhibit strong absorption by the exospheric cloud, and 
choosing the out-of-transit references of visits 2 and 3 leads to a much deeper 
absorption signal being measured. 
Transit ephemeris. Several ephemerides of GJ 436b transit have been published in 
the literature®*°*°? and we list them in Extended Data Table 2. Visit 1 transit 
observation has been published previously*, which reports the use of the following 
ephemeris values: P = 2.643850(90) days and Tp = 2,454,279.436714(15), for the 
period and mid-transit time, respectively. They attribute the period value to a 
previous study”, although the values eventually reported in these two studies**° 
are different, as can be seen in Extended Data Table 2. 

We recalculate the transit phases of the visit 1 observation using one of the most 
recent available ephemerides”®. After taking into account the ~6 min offset 
between the HST time values expressed in modified Julian days (MJD) and the 
barycentric Julian days (BJD'), we obtained the phase values for the four HST 
orbits of the visit 1 observation (—1.74 h, —0.35 h, +1.25 h and +2.85 h). The 
obtained values are all ~42 min in advance of the phase values reported by ref. 6 
(—1.05 h, +0.35 h, +1.95 h and +3.54 h). Trying to understand where this 
substantial difference came from, we realized that we would obtain similar transit 
phases as ref. 6, offset by ~6 min, when using the Tp value of ref. 30 and the period 
given by ref. 6, which does not correspond to any value previously reported in the 
literature. The remaining ~6 min difference presumably comes from our account- 
ing for the delay between BJD and MJDurc. 

New interpretation of a previous analysis of the visit 1 data. The 42 min offset 
has two important consequences with respect to the visit 1 data interpretation by 
ref. 6. These authors indeed detected an absorption signature starting at the very 
end of the optical transit, with a maximum ultraviolet absorption depth of 22.9 + 
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3.9% (1o) occurring post-transit (~2 h after the end of the optical transit), accord- 
ing to their ephemeris. Shifting their data points 42 min earlier brings back the 
start of their absorption signal in-transit. The new phase of the first HST orbit in 
visit 1 is now — 1.74h before the optical mid-transit time, which corresponds to the 
pre-transit phases in visits 2 and 3. Since we observe a significant (>3¢) absorption 
signal of 17.7 + 5.2% in these visits, it means that the first HST orbit from visit 1, 
that ref. 6 used as an out-of-transit reference, is already partially absorbed by the 
exospheric cloud. In fact, the flux measured during the first HST orbits of visits 2 
and 3 is significantly (>3¢) higher than the flux measured in the second orbits of 
these visits (by the value of 17.7 + 5.2% quoted earlier), and in good agreement 
with the flux measured ~30 h after transit, several years before in visit 0 (ref. 17), as 
can be seen in Fig. 2. The absorbed flux measured in the second HST orbits of visits 
2 and 3 is a close match to the flux measured during the first HST orbit of visit 1. 
Using updated and more accurate transit ephemerides thus allows us to conciliate 
all HST observations within a common picture, quite different from the one 
painted by ref. 6. Not only does this substantially change the timing of the ultra- 
violet transit, but it also changes the amplitude of the measured absorption by a 
factor of ~2.5. This is critical for understanding the hydrogen atom dynamics and 
the exospheric cloud structure. 

X-ray observations and EUV flux of GJ 436. Magnetic activity could induce 
variations in the stellar emission. We use the unabsorbed red part of the line to 
estimate that the intrinsic stellar variability at GJ 436 should not exceed 5% to 11% 
of the Lyman-o flux during one visit. Our Chandra X-ray data cover pre-transit 
phases at four epochs, two of which during HST visits 2 and 3, and we use them to 
monitor stellar activity. We detect a faint and soft X-ray source at the coordinates 
of GJ 436 in each of the four Chandra observations, with count rates of 2.0 + 0.3, 
3.1 + 0.4, 2.4 + 0.4and 2.6 + 0.4ks_! for the February, April, June 2013 and June 
2014 visits respectively. We extracted light curves (Extended Data Fig. 5) and 
spectra for each of these visits, as well as a combined spectrum for all four visits 
(Extended Data Fig. 6). 

We also reanalysed the XMM-Newton EPIC-pn spectrum of GJ 436 (ref. 28) 
(Extended Data Fig. 6) and found a higher X-ray flux and luminosity than prev- 
iously reported. We find an acceptable fit to the XMM-Newton spectrum with a 
two temperature thermal plasma model (APEC) with fitted temperatures of 0.09 
and 0.38 keV and abundances of 0.18 Solar (fixed to the value listed by ref. 28). We 
measure a flux of 4.6 X 10° '* ergs’ cm” in the band previously used” (0.12- 
2.48 keV or 0.5-10 nm), giving a luminosity in this band of 5.7 X 10*° ergs” ' (ref. 
28 values were 7.4 X 10° ergs ‘cm 7 and9.1 X 10” ergs’, respectively). Our 
higher values for flux and luminosity are in much closer agreement with the 
measurement made from the ROSAT All-Sky Survey data for GJ 436 (ref. 27). 
We also inspected the ROSAT data and confirmed the marginal detection of the 
source with approximately eight photons, all below 0.3 keV. 

We fitted the combined spectrum from our four Chandra observations simul- 
taneously with the XMM-Newton EPIC-pn spectrum (Extended Data Fig. 6). The 
Chandra and XMM-Newton spectra match well and we find an acceptable sim- 
ultaneous fit with the two-temperature model described earlier. The temperatures 
and relative normalization of the two components were left free but forced to take 
the same values for the two data sets. We find temperatures of 0.097 keV and 0.72 
keV, or approximately 1 MK and 8 MK. Only the fluxes were allowed to vary 
between the XMM-Newton and Chandra spectra, and this combined fit can be 
seen in Extended Data Fig. 6 (XMM-Newton PN is black, Chandra ACIS-S is red). 
In the overlapping energy range (0.234-2.0 keV) X-ray fluxes are 1.84 X 107 '* erg 
s ‘cm 7 for XMM-Newton (December 2008) and 1.97 X 10 '* ergs lcm? for 
Chandra (averaged across the four observations in 2013 and 2014). 
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We also fitted the four Chandra spectra individually, fixing the two tempera- 
tures to those derived from the combined fit to the XMM-Newton and Chandra 
data. This allowed us to determine the emission measures of the two temperature 
components as a function of time, which are plotted in Extended Data Fig. 7. We 
find that the higher temperature component, which dominates the spectrum 
above 0.5 keV, is more variable than the lower temperature component (dominat- 
ing below 0.5 keV), as shown by Extended Data Fig. 8. The variability of the higher 
temperature component accounts for most of the count rate variability in the 
Chandra observations and is presumably caused by stellar flares. The relatively 
stable lower temperature component is a useful proxy for the unobserved EUV 
emission that dominates the high-energy irradiation of the planet and probably 
drives the mass loss. The final two Chandra observations are contemporaneous 
with our HST visits 2 and 3, and while the emission measure of the high temper- 
ature component increases by a factor 2.3 between these observations, the emis- 
sion measure of the low temperature component increases by only 7%. This helps 
to explain the very similar ultraviolet absorption depths and implied mass loss 
rates seen in HST visits 2 and 3. It can also be seen in Extended Data Fig. 8 that the 
emission measure of the lower temperature component was much lower during 
the first two Chandra observations in February and April 2013. HST visits at these 
times might have observed lower mass loss rates from the planet. 

We use the X-ray flux measured in our third Chandra observation (contem- 
poraneous with HST visit 2) to estimate the EUV flux irradiating the planet. The 
fitted model implies a flux in the 0.12-2.48 keV band (0.5-10 nm) of 4.7 X 10° 4 
erg s_' cm ” (consistent with the flux at the time of the XMM-Newton obser- 
vation). Converting to a flux at the surface of the star and comparing with the 
relation in figure 2 of ref. 33 leads to an estimate of the EUV flux at a reference 
separation of 1 au of 1.0 ergs! cm™* (12.4-91.2 nm), which is a factor of 5 higher 
than the X-ray flux at 1 au of 0.21 ergs ' cm” (0.5-10 nm). 

This X-ray-derived EUV flux compares favourably with an estimate based on 
the strength of the Lyman-z line. Taking the total reconstructed line flux of 3.5 X 
10 ergs ' cm ” (ref. 24) and using the model from table 6 of ref. 34 gives an 
estimated EUV line flux at 1 au of 1.06 erg s ‘cm * (10-91.2 nm). The consist- 
ency of the EUV flux estimates based on our X-ray and Lyman-« observations 
gives confidence that the high energy irradiation of the planet is well constrained 
despite being unobservable directly. 

Code availability. We have opted not to make the code used to produce the 
numerical simulation available. 
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Extended Data Figure 1 | Time evolution of GJ 436 Lyman-a line during (blue), in-transit (green) and post-transit (red and magenta for the last HST 
each HST visit. There is one spectrum per HST orbit. Colours indicate the orbit in visit 1). For visit 1 there is no out-of-transit spectrum, hence we over- 
phase with respect to the optical transit: out-of-transit (black), pre-transit plotted the out-of-transit spectrum from visit 2 (dotted line). 
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reproduced with the numerical simulation. a—d, The panels show the stellar _ instrumental line spread function. b-d, This theoretical profile is compared to 
emission line at different transit phases, here for visit 2 (all error bars are 1a). _ observations (blue curves): the numerical simulation computes absorption in 
a, The out-of-transit reference line (black curve) is the calculated profile best- _ the theoretical profile and adjusts absorbed line profiles (dashed lines) as a 
fitting the observed out-of-transit line profile (blue curve), after taking into function of time to the observations. 
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Extended Data Figure 3 | Light curve from visit 1 data. a, b, The light curve 
was calculated from integration of the flux over the red part of the line in the 
velocity range [+20,+250] km s -. The temporal (a) and phase-folded (b) light 
curves are shown. The different colours represent data acquired during 
different consecutive HST orbits. The vertical dashed lines in a indicate the 
location of the optical transit contacts. The horizontal dashed lines in b show 
the best-fit constants to each HST orbit (orbit 1 data—the open circles—are not 
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fitted because of its different phasing). We did not apply any correction to this 
visit. We also did not find it necessary to trim out the first HST orbit in this 
visit (or in the subsequent ones) due to possible increased systematics, as 
described previously'*. These time series have a higher sampling (by a factor 
of ~2) than those in Fig. 2, which is made possible by exploiting the time-tag 
mode of data acquisition. The 1o error bars have been propagated accordingly. 
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Extended Data Figure 4 | Correction steps for the visit 2 data. a-d, The 
different colours represent different consecutive HST orbits. The vertical 
dashed lines in the temporal light curves indicate the optical transit contact 
points. a, Raw light curve obtained from integration of the flux over the red part 
of the line in the velocity range [+20,+250] km s_'. b, Breathing correction. 
This is the same light curve as in a, phase-folded on the HST orbit. The dashed 


colour lines are linear fits to the different HST orbit data. Data from HST orbit 1 
(empty black circles) are not taken into account because of their slightly 
different phasing. c, Light curve corrected from telescope breathing. A linear 
trend (dashed line) representing long-term systematics is fitted to the data. 

d, Light curve corrected for telescope breathing and long-term systematics. All 
error bars are lo. 
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Extended Data Figure 5 | Correction steps for the visit 3 data. Identical to the description of Extended Data Fig. 2 for visit 2 data. All error bars are 1c. 
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Extended Data Figure 6 | Chandra X-ray counts of GJ 436. The different symbols show the different visits. The June 2013 (stars) and June 2014 (squares) 
observations were contemporaneous with HST visits 2 and 3. The vertical dashed lines represent the contacts of the optical transit. All error bars are lo. 
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Extended Data Figure 7 | X-ray spectrum of GJ 436 fitted with a two- because of the lower sensitivity of the instrument. b, Unfolded spectra in flux 
temperature model. a, b, The XMM-Newton EPIC-pn spectrum is shown in _ units; it can be seen that the XMM-Newton and Chandra spectra are consistent 
black, the combined Chandra ACIS-S spectrum is shown in red. a, Spectrain with each other. The dotted lines show the contributions of the individual 
units of measured counts, with the Chandra spectrum below XMM-Newton temperature components. All error bars are lo. 
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Extended Data Figure 8 | Emission measures for the two temperature 
components fitted to the individual XMM-Newton and Chandra spectra. 
The red points correspond to the low temperature component (0.097 keV) that 
dominates the spectra below 0.5 keV. The blue points correspond to the higher 
temperature component (0.72 keV) that dominates above 0.5 keV. The higher 
temperature component is more variable, probably due to the effect of stellar 
flares. The less variable lower temperature component is a proxy for the 
unobserved extreme-ultraviolet emission that dominates the high-energy 
irradiation of the planet. The last two points are contemporaneous with HST 
visits 2 and 3. 
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Extended Data Table 1 


Log of the HST and Chandra observations 


Observation date HST/STIS/G140M Chandra/ACIS-S 
GO#11817 
1 HST orbit (“visit 0”) 
2010-01-05 Ref. 17 
GTO#12034 
2012-12-07 1 transit (4 HST orbits, “visit 1”) 
Ref. 6 
GO#12965 Obs ID 14459 
2013-02-16 Delayed, no data 1 transit (18.77 ks) 
GO#12965 Obs ID 15537 
2013-04-18 Target pointing problem, no data 1 transit (19.75 ks) 
GO#12965 Obs ID 15536 
2013-06-18 1 transit (4 HST orbits, “visit 2”) 1 transit (18.91 ks) 
2014-06-23 GO#12965 Obs ID 15642 


1 transit (4 HST orbits, “visit 3”) 


1 transit (18.77 ks) 
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Extended Data Table 2 | Ephemerides of GJ 436b transit 


Mid-transit time To Period P Reference 
(JD) (days) 

BJD 2 456 295.431 924(45) 2.643 897 82(8) Ref. 31 
BJD 2 454 865.084 034(35) 2.643 898 03(30) Ref. 32 
BUD 2 454 865.083 208(42) 2.643 897 9(3) Ref. 26 
HJD 2 454 415.620 74(8) 2.643 890 4 Ref. 30 
HJD 2 454 415.620 74(8) 2.643 850(90) Ref. 6 


The offset between BJD and heliocentric Julian days (HJD) is small, typically +4 s. 


*Values used in the present work. 
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Linking high harmonics from gases and solids 


G. Vampal, T. J. Hammond", N. Thiré?, B. E. Schmidt”, F. Légaré’, C. R. McDonald", T. Brabec! & P. B. Corkum! 


When intense light interacts with an atomic gas, recollision 
between an ionizing electron and its parent ion’ creates high-order 
harmonics of the fundamental laser frequency”. This sub-cycle 
effect generates coherent soft X-rays’ and attosecond pulses’, 
and provides a means to image molecular orbitals’. Recently, high 
harmonics have been generated from bulk crystals®’, but what 
mechanism*? dominates the emission remains uncertain. To 
resolve this issue, we adapt measurement methods from gas-phase 
research’*"* to solid zinc oxide driven by mid-infrared laser fields 
of 0.25 volts per angstrom. We find that when we alter the genera- 
tion process with a second-harmonic beam, the modified harmonic 
spectrum bears the signature of a generalized recollision between 
an electron and its associated hole’’. In addition, we find that solid- 
state high harmonics are perturbed by fields so weak that they are 
present in conventional electronic circuits, thus opening a route to 
integrate electronics with attosecond and high-harmonic techno- 
logy. Future experiments will permit the band structure of a solid’® 
to be tomographically reconstructed. 

Fifty years ago, Keldysh introduced an important idea. Treating 
both gases and solids on the same footing, he showed that strong-field 
ionization—valence-band to conduction-band transitions in solids— 
can be approximated by tunnelling’®. Over the following decades, 
research on the interaction of intense pulses with solids and gases 
has diverged. Much of the solid-state research has been motivated 
by laser materials processing’’. A great deal of the gas-phase research 
has been motivated by high-harmonic generation'*”, a process 
initiated by tunnelling and completed by the recollision of a highly 
energetic electron with its associated ion (hole). However, the deep 
similarity in their response to strong fields has been largely ignored. 
Now that high harmonics have been measured from ZnO (ref. 6) and 
other condensed media’, it is important to reconsider the relation 
between the gas-phase and solid-state mechanisms. 

Individual ionizing atoms provide two sources of harmonics. 
Expressed in the language of solid-state studies, there is a ‘single-band’ 
(intraband) contribution, in which the recently tunnelled electron 
wave packet undergoes oscillatory motion in the laser field. Here the 
nonlinearity enters through the step-wise nature of tunnel ionization”’. 
In addition, there is a ‘two-band’ (interband) source. In atoms, this 
source is interpreted in terms of the ionized electron recolliding and 
recombining with its associated hole (parent ion), emitting a high- 
energy photon in the process. In gases, the former source dominates 
the low harmonics while the latter source dominates high-harmonic 
generation. Electrons and holes in solids exhibit similar mechanisms as 
they move in, and between, the conduction and valence bands’®"’. 
However, in a solid, the electron’s interaction with the lattice makes 
its oscillation nonlinear; this is a new source of intraband emission®, 
and leads to a generalized recollision'’. The relative contribution of 
these intra- and interband mechanisms and their interplay is import- 
ant theoretically** and experimentally’. The unresolved issue is under 
what circumstances does one physical mechanism dominate produc- 
tion of high harmonics in solids. 

To resolve this issue, we adapt a method of gas-phase research to 
solids'*. Using a mid-infrared fundamental laser to generate odd 


harmonics in ZnO, we perturb the process with a very weak second- 
harmonic beam, thereby producing even harmonics. The strength of 
the even harmonics modulates as the second harmonic is delayed 
relative to the fundamental. The phase of the modulation for each 
harmonic order determines the spectral phase of the emitted harmonic 
beam and characterizes the generation mechanism. Using photons 
with energies well below that of resonance (hv ~ 0.1E,, where E, is 
the bandgap of the solid), we find that a generalized recollision 
between the electron and its associated hole plays the dominant role 
in high-harmonic emission from ZnO at the field strength of our 
experiment, as it does in gases. By ‘generalized recollision’ we mean 
that between tunnelling and recollision the electron and hole move 
on their respective bands, as discussed in the Methods section. 
Furthermore, as we increase the relative second-harmonic intensity, 
for each intensity range we find solid-state behaviour that closely 
mirrors the atomic response. There is, however, one possible excep- 
tion. As described in the Methods section, in gases there are so-called 
long and short trajectory contributions, which lead to differently diver- 
ging radiation. Although we have searched for them, we have not 
resolved long-trajectory harmonics in our experiment. However, even 
this may be the same in gases since long-trajectory harmonics are hard 
to observe with infrared drivers. 

To understand the experiment it is useful to think of high-harmonic 
generation as a balanced interferometer. As illustrated in Fig. la for 
atomic recollision, the electron (and hole) wave packets created on 
subsequent half-cycles of the driving field propagate on equal but 
oppositely directed trajectories (equal-length arms of an interfero- 
meter), creating sub-cycle bursts of short-wavelength radiation when 
they recollide. With each subsequent pulse having opposite phase 
because of the oppositely directed collision, when the pulses are spec- 
trally resolved, only odd harmonics constructively interfere. Adding a 
weak second-harmonic field unbalances the trajectories (interfero- 
meter), adding a small phase to one arm while removing it from the 
other, thereby producing even harmonics. For ZnO, because of the low 
reduced carrier mass near the band centre, low-photon-energy har- 
monics are very sensitive to weak second-harmonic fields. In a recolli- 
sion process, the phase that maximally unbalances the interferometer 
differs between harmonic orders. While illustrated for recollision, the 
interferometer analogy also applies to intraband oscillations: here, 
the asymmetry originates from the slightly different current generated 
in two successive half-cycles, as a result of the electron reaching 
higher (lower) momentum when it propagates in the left (right) side 
of the Brillouin zone. Because harmonics are emitted as the electron 
moves, they are all phased together’®. The second harmonic perturbs 
all of them equally and simultaneously, so the phase that maximally 
unbalances the interferometer is the same for all harmonics. This 
different behaviour allows us to discriminate between recollision and 
non-recollision based mechanisms. 

Experimentally, high harmonics are produced by focusing the 
fundamental and the second harmonic of mid-infrared laser pulses 
(3.76 jum central wavelength) in a,500-nm ZnO single crystal at an 
intensity of 0.85 TW cm 7 (0.25 V A‘) in the crystal. The two colours 
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Figure 1 | Realization of the in situ experiment. a, Pictorial representation of 
the electron’s trajectories. In the fundamental field alone, the electron travels 
symmetrically to the left and to the right of the ion (central grey sphere) during 
two successive half-cycles (red dashed lines). The addition of the second 
harmonic lengthens one trajectory while shortening the other (solid blue lines), 
thereby breaking the symmetry and producing even harmonics. b, Measured 
high-harmonic spectra versus the delay between the two colours; the spectral 
intensity is colour-coded (key at right). The even harmonic intensity modulates 
every half-cycle. The odd harmonics also modulate, but the modulation is 
much weaker. The phase of the even harmonic modulation depends on the 
harmonic order, as evidenced by the red solid line relative to the black reference 
line. (The red line links the minima in the modulation of the even harmonic 
intensity.) Each harmonic order is normalized separately. The relative 
intensity between even-order and adjacent odd-order harmonics is given at the 
top (bn/Ibn—1). The delay of the second harmonic is defined in cycles of the 
fundamental. 


(that is, fundamental and second harmonic) have parallel polarizations 
(details of the experimental set-up are given in the Methods section). 

Figure 1b shows the harmonic spectrum versus the relative phase 
between the two colours, for a second-harmonic intensity 9 107° 
times that of the fundamental. Each harmonic order is independently 
normalized; the relative intensity between adjacent orders (Ipy/Inn-1) 
is reported at the top of the graph (N is an integer that identifies the 
harmonic order). The weak even-harmonic signal modulates with an 
order-dependent phase ®,,,(2N) as a function of delay. Similar beha- 
viour is measured for atomic harmonics'*, where ®,,, is used to extract 
the spectral phase of each even harmonic or, equivalently, the emission 
time of the attosecond pulses. Here, we use ®,,, to establish the origin 
of solid-state harmonics. 

Figure 2 shows ®,,. plotted as a function of harmonic order (black 
circles) up to the cut-off of our spectrograph. We use an unmeasured 
constant phase to position the experimental data (this constant phase is 
measurable using nonlinear optical methods). The theoretical values of 
@,s- calculated from a two-band model are also plotted for the intra- 
band (purple line) and interband (blue line) sources, and for their 
combined emission (gold dashed line); the calculations are described 
in the Methods section. We find that the experiment is consistent with 
the theoretical prediction (Extended Data Fig. 6) that interband har- 
monics dominate in ZnO for the field strength used (we note that recent 
work has found evidence for the dominance of intraband oscillations 
for THz excitation’). In the Methods section we investigate theoretically 
how our results scale with the field strength of the fundamental. 

The slope of the curves in Fig. 2 is determined by both the intensity 
of the fundamental and the band structure of the crystal. Crystal dis- 
persion has two major consequences: first, the electron-hole pair accu- 
mulates more phase than in a gas (~ 10 times for the short trajectories), 
making it easier to break the symmetry with the second-harmonic 
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Figure 2 | Comparison with theory. The phase of the modulation of the 


even harmonics is extracted from Fig. 1 (black circles) and compared to 
the simulated intraband (purple line) and interband (blue line) phase. The 
simulated phase for their combined emission (intraband plus interband, 
yellow dashed line) agrees with the interband emission, in agreement with 
interband harmonics being significantly stronger than intraband ones 
(Extended Data Fig. 6). 


field. This is why we require second-harmonic intensities of only 
~10 > times that of the fundamental. Second, crystal dispersion influ- 
ences the duration of the high-harmonic pulses. We measure a har- 
monic chirp of 0.38 fs eV’ at the 16th harmonic, corresponding to a 
train of pulses with duration of 1.7 fs (before dispersion compensation) 
for a ~2 eV bandwidth. Wider bands and higher intensities will 
result in smaller chirps, thereby allowing attosecond pulse generation 
from solids. 

As the second-harmonic intensity is increased, we move from per- 
turbing to controlling harmonic emission. In gases, we first reach an 
intensity where ®,,. is nearly out of phase for the even and odd 
harmonics’’. Figure 3a shows experimental results for ZnO using a 
second-harmonic intensity of ~10°* times that of the fundamental. 
The alternation between even and odd harmonics occurs when the 
second harmonic modulates the electron phase by ®,,. = 1/2. For 
even higher second-harmonic field strengths (Fig. 3b), high-harmonic 
emission is substantially modified. In the highly asymmetric sum field, 
tunnelling only occurs once per laser cycle. This leads to simultaneous 
emission (or suppression) of even and odd harmonics—a behaviour 
well-studied for atomic gases". 

Taken together, our findings provide strong evidence that, for the 
field strength and laser wavelength of our experiment, a generalized 
recollision between an electron and its associated hole is the primary 
source of high harmonics in ZnO. Thus, 50 years after Keldysh’s 
seminal paper’, we rebuild the connection linking the high-field res- 
ponse of gases and solids. But decoherence is a major difference 
between solids and gases. The occurrence of electron-hole recombina- 
tion demonstrates the tolerance of the short-lived electron to dephas- 
ing. Although scattering times vary considerably between materials**” 
and with excitation conditions”, our measurements suggest that high- 
harmonic technology could have a broad impact in condensed media. 

In this paper we have exploited our ability to manipulate the phase 
that the electron-hole pair acquires to identify the fundamental mech- 
anism dominating harmonic generation in ZnO. The phase also 
encodes information about the band structure of the material in which 
the electron travelled. By measuring ®,,- as a function of crystal ori- 
entation, it will be possible to reconstruct a material’s three-dimensional 
momentum-dependent band structure’. This all-optical technique will 
allow measurement of band structures where it is not possible to detect 
photoelectrons, for example in high-pressure experiments”. 

Although we have not concentrated on it here, the tunnelling step 
determines the harmonic intensity. Our weak second-harmonic beam 
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Figure 3 | From perturbing to controlling harmonic generation. a, When 

the intensity of the second harmonic reaches 1 X 10 “ times that of the 


fundamental, the even and odd harmonics modulate out of phase. b, Further 
increasing the intensity to 3 X 10” ° times that of the fundamental, we find that 


not only modifies the electron phase but also slightly alters tunnelling, 
thereby leading to oscillations of the odd harmonics with the delay 
between the two colours. The relative phase of the modulation between 
even and odd orders contains detailed information on the time- 
dependent tunnelling rate*®. Applying this analysis to solids will allow 
us to probe the sub-cycle dynamics of strong-field tunnelling in 
solids””**—the essence of laser material modification”. 

Finally, independent of the mechanism, we have demonstrated that a 
clear spectral signature can be imposed ona high-harmonic beam using 
a control beam having a peak electric field of ~5 V um *. Fields of this 
magnitude are present in electronic circuits”. Thus, we have bridged 
the gap between electronics and attosecond physics. If the sensitivity 
that we have demonstrated to perturbing fields can be transferred to 
silicon, it will be possible to record movies of working semiconductor 
electrical circuits or plasmons propagating in nano-plasmonic devices. 
Harmonics generated from a spatially complex circuit will diffract 
according to the instantaneous distribution of the internal fields. The 
control field can also be applied by electrodes on the crystal, thereby 
allowing manipulation of the harmonic beam with DC or pulsed elec- 
trical signals. Thus we have taken a significant step towards a new area 
of research, that of solid-state attosecond electronics”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Experimental set-up. High harmonics are generated with a mid-infrared laser 
source centred around 3.76 1m and with 19 J pulse energy. The pulse duration of 
95 fs is measured with a dispersion-free SHG FROG after the ZnO crystal. 
The mid-infrared wavelength is obtained by difference-frequency generation in 
a 400 ym thick AgGaS, (AGS) crystal between the signal and the idler of a 
commercial Optical Parametric Amplifier (Light Conversion Topas-HE) pumped 
by 4.5 mJ from a Ti-sapphire laser. The output of the OPA is further amplified in a 
beta barium borate (BBO) crystal pumped by 10 mJ of 800 nm pulses. The mid- 
infrared beam is spatially filtered with a pin-hole and focused with an f/30 spher- 
ical Ag mirror onto an epitaxially grown 500 nm thin film of a single crystal of 
wurtzite ZnO(0001) deposited on a 0.5 mm sapphire(0001) substrate. The optical 
axes of both crystals are aligned parallel to the laser k-vector. We used the knife- 
edge technique to measure the size of the focal spot. 

The second harmonic is generated in a 300 jm AGS crystal optimized for type-I 
SHG right after the pin-hole to exploit the high intensity and high beam quality. 
The second-harmonic intensity is controlled by moving the AGS crystal closer to 
or farther from the pin-hole. The two beams are separated and then recombined 
before the focusing mirror with dichroic beam splitters (LaserOptick) that reflect 
the second harmonic. Its polarization is rotated with a broadband 4/2 plate before 
recombination. The delay between the two colours is scanned with a PZT stage on 
the second-harmonic arm. 

The high harmonics are refocused with an Al mirror into a visible-ultraviolet 

spectrometer from OceanOptics. 
The theoretical model. To interpret the experiment, we use the theoretical frame- 
work developed in ref. 11. The nonlinear response of the solid is described by a two 
band model whose characteristics are chosen to be those of ZnO. The model solves 
the semiconductor Bloch equations for the time-dependent band populations 
Nm(kt) (m = v,¢ for the valence and conduction bands respectively) and for the 
interband polarization p(k t), where k is the crystal momentum, for a single 
electron-hole pair. Coulomb interaction between different pairs, leading to coup- 
ling of different k-states*’, is neglected. The band parameters and lattice constants 
are reported in ref. 32. The laser parameters are: frequency @ = 0.0121 a.u., peak 
field strength Fy = 0.0049 a.u. 

Extended Data Fig. 1 shows the population of the conduction band along the 
I’M direction of the reciprocal space as a function of time. The laser field creates 
electron-hole pairs around each field extreme. They are then accelerated to high 
crystal momenta and back to the centre of the Brillouin zone in a process that 
repeats every half laser cycle. 

The model naturally includes the sub-cycle exchange of the band population— 
the analogue of the step-wise ionization in gases. However, in solids both the 
single-band (intraband) and the two-band (interband) mechanism bear the sig- 
nature of the generalized electron-hole recollision. 

Extended Data Fig. 2a, b shows the high-harmonic spectrum of the respective 
intraband and interband terms as a function of time. The spectra are obtained 
from a windowed Fourier transform of the intra- and interband currents, defined 
in ref. 11, which can be separately calculated by the model. A 0.35 cycles wide 
Blackman window is scanned across half a laser cycle. The resulting spectra are 
continuous because only one recollision event is allowed inside the temporal 
window. We apply a super-Gaussian spectral filter that does not affect above 
bandgap harmonics, but progressively filters lower harmonics. This is done 
because the temporal window is too narrow for the lower orders (and the fun- 
damental frequency) and results in artificial broadening of these harmonics which 
ultimately masks the weak high-harmonic signal if the filter is not applied. 
Interband harmonics above the bandgap are approximately 10* times stronger 
than intraband ones. For each harmonic photon energy, the maximum spectral 
intensity determines its time-of-emission (or spectral phase). 

For comparison, the time of generalized recollision is traced in red in Extended 
Data Fig. 2a, b. The time of re-encounter of the electron with its associated hole is 
extracted from their classical motion in the conduction (for the electron) and 
valence (for the hole) bands relative to the time of the field crest in which they 
were created at zero crystal momentum by strong-field tunnelling. Their accelera- 
tion in reciprocal space is k(t) = A(t) — A(t’), where A(t) is the laser vector 
potential, and ?’ is the time of creation of the electron-hole pair. The real space 
velocity is V,[k] = Vx E,,(k), with E,,(k) the energy dispersion for band m, from 
which the instantaneous position is obtained by temporal integration. This semi- 
classical model, introduced in ref. 11, has been detailed in ref. 32. 

Extended Data Fig. 2 shows that both interband and intraband currents produce 
high harmonics with a spectral phase determined by the generalized recollision. 
This recollision contribution is contained in the structure of the intraband equa- 
tion’. Contrary to interband harmonics, intraband emission is dominated by 
nonlinear oscillations and by tunnelling below the 18th harmonic order. The 
transition is reflected in the in situ calculation (see Fig. 2), where ®,;- becomes 
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independent on the delay between fundamental and second harmonic below the 
18th harmonic. 

Many electron-hole pairs are created by the strong laser field. Owing to the 
interaction of one electron (hole) with the others, the electron (hole) can scatter to 
a different k-state than that of its correlated hole (electron), therefore preventing 
their recombination. However, in a many-body theory, each carrier can encounter 
a different oppositely charged partner. In atomic high harmonic generation it was 
argued that recombination of the electron with an uncorrelated hole is not allowed 
in principle**. This debate is yet to be settled in solids. Our experiment only detects 
recollision between correlated electron-hole pairs. 

Modelling the experiment. A weak second-harmonic field is added to the fun- 
damental one to simulate the in situ experiment. The high-harmonic spectrum is 
recorded for each phase between the two colours. The resulting spectrograms for 
the interband and intraband emission are reported in Extended Data Fig. 3a, b, 
respectively. We require the second harmonic to be 6 X 10 * relative to the 
fundamental to produce even high harmonics that are ~5% of the odd ones. 
The modulation depth is close to 100% for both intra- and interband generated 
above bandgap harmonics. It drops to ~50% for below bandgap interband har- 
monics and for harmonics below the 18th for intraband emission. The spectro- 
grams of the interband and intraband currents and of the sum of the two are 
analysed in the same way as the experimental data: (i) the intensity of each 
harmonic is spectrally integrated over its width; (ii) the resulting intensity modu- 
lation as a function of delay is offset to 0 and normalized to 1 and (iii) fitted to a 
cosine function of delay with form cos(detay + osc). Finally, Pos for both 
emission mechanisms and for the total emission (the sum of the two) is extracted 
and shown in Fig. 2 of the main text. 

Comparison with atomic high-harmonic generation. We analyse the similar- 
ities between the high harmonics from atoms and solids based on calculation of 
classical trajectories. In Extended Data Fig. 4 we plot ®,,, calculated for an isolated 
atom (green line). The field strength of the fundamental is increased to 0.44 V A”! 
to allow the same cut-off as in ZnO. The phase for interband emission in ZnO and 
the experimental data are the blue line and the black circles, respectively. 
According to the analytical equivalence between atomic and solid emission, 
demonstrated in refs 11, 32, the intensity modulation of the even harmonics can 
be calculated starting from the additional dipole phase’: 


o(t.p) = | 


t 

V(1,t')Ao(t,)dt 
t(t) 
Where A(t, $) = Az cos(2mt + ¢) is the vector potential of the second harmonic 
and VU(k) = Vx ég(k), with ¢.(k) the momentum dependent bandgap. Once the 
semi-classical trajectories are known (as discussed in the previous section), 
the integral can be computed and the phase of the modulation extracted from’? 
®,,. = atan(a,/o,) with a(t, 6) = o,(t) cos(#) + o(t) sin(¢). 

In atomic high-harmonic generation there are two important contributions to a 
given high harmonic. One is from the so-called short trajectory electrons. They 
appear as the curve with negative slope at larger phases in Extended Data Fig. 4. 
Almost all research on gas-phase harmonics has concentrated on this contri- 
bution. There are also long trajectory electrons. They are responsible for the other 
part of the curve and are often ignored in gases since their divergence is larger and 
an experiment can be designed to minimize their impact. 

In ZnO, the long trajectory contribution is significantly different from the 

atomic case. This is a direct consequence of the non-parabolic band dispersion. 
The distortion of the trajectories is extensively analysed in ref. 32. Further, no 
prediction can be made for below-bandgap harmonics, since classically the elec- 
tron cannot recollide with energy smaller than the minimum bandgap. A quantum 
mechanical calculation that includes the effect of the tunnelling step is required to 
extend the comparison to smaller photon energies. 
Theoretical cut-off scaling of interband emission. The high-harmonic spectrum 
extends up to a defined maximum harmonic order, called the cut-off. In atoms, the 
cut-off is related to the maximum kinetic energy available to the electron when it 
recombines with the hole. The cut-off depends linearly on the ponderomotive 
energy U,, which, in turn, depends quadratically on the field strength. 

In contrast with atomic harmonics, experiments in solids® show a linear scaling 
of the cut-off with field strength. Theoretical investigation of the intraband current 
confirmed the linear scaling*. Extended Data Fig. 5 shows that the simulated 
interband current also leads to a cut-off that scales linearly with the field strength. 
Exploiting the analytical solution for this current®, it is possible to derive a semi- 
classical approximate cut-off scaling law that fully agrees with the simulation. This 
analysis is reported in ref. 32. 

Role of interband emission for various field strengths. We theoretically invest- 
igate the relative importance of interband and intraband emission at different field 
strengths at the laser wavelength used in this experiment. Extended Data Fig. 6 
reports the interband and intraband spectra for Fy = 0.003 a.u., Fy = 0.0049 a.u. 
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(corresponding to the field strength reached in our experiment) and Fy = 0.008 a.u. 
(corresponding to the field strength reached in ref. 6, including the loss from 
normal incidence reflection). In all cases, interband emission dominates by four 
orders of magnitude for harmonic orders above the minimum bandgap (marked by 
the vertical dashed black line). Below the minimum bandgap the difference is less. 

A less clear harmonic structure is observed at the highest field strength. The loss 
of contrast arises from the interference between two sets of recolliding trajectories: 
one of electrons born slightly after the peak of the field (the ‘short’ branch of the 
atomic case) with one of electrons born just before the peak of the field. In 
the atomic case, electrons born before the peak do not recollide. In a solid, the 


non-parabolic shape of the band structure allows these extra recombinations. The 
topic will be a matter of a future publication. 
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Extended Data Figure 1 | Evolution of conduction band population with 
time as a function of Bloch vector. Population in the conduction band 
(colour-coded, key at right) is created at each peak of the laser field and 
subsequently accelerated to large crystal momenta. The population is resolved 
along the IM direction of the Brillouin zone. 
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Extended Data Figure 2 | High-harmonic emission is timed to the electron- _recollision between the electron and the hole (red line). The intensity 
hole recollision. The spectral content of the intraband (a) and interband (colour-coded) is on a logarithmic scale. The laser parameters are: 
(b) mechanisms as a function of time shows that each high harmonic is emitted @ = 0.0121 a.u., Fo = 0.0049 a.u. 

at a specific moment of the laser cycle. This moment is the time of generalized 
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Extended Data Figure 3 | Predicted in situ spectrograms. Simulated spectrogram for interband (a) and intraband (b) emission. Intensity is colour-coded on a 
logarithmic scale. Each harmonic order has been normalized to simplify comparison. The laser parameters are: @ = 0.0121 a.u., Fo = 0.0049 a.u., Foo, = 10? Fo. 
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Extended Data Figure 4 | Comparison of atomic and solid high-harmonic 
generation. The phase of the modulation of the even harmonic intensity is 
plotted as a function of harmonic order. The green line is obtained for an atom 
with ionization potential equal to the bandgap of ZnO (3.3 eV, marked by 
the vertical black dashed line). The blue line and the black circles are the 
theoretical prediction for the interband source and the experimental data 
points, respectively. The theoretical prediction is based on classical trajectory 


calculation. 
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Extended Data Figure 5 | Cut-off scaling of interband emission. The high- 
harmonic cut-off of interband emission predicted by the numerical solution of 
the two-band model is plotted as a function of field strength. The cut-off scales 
linearly with the field strength up to the maximum bandgap (reached at the 
edge of the Brillouin zone). The calculated values (red dots) are connected 
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by the dashed dotted line to show the linear scaling. 
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Extended Data Figure 6 | Scaling of high-harmonic emission with the field 
strength of the fundamental. The two-band model predicts that interband 
emission dominates over a wide range of field strengths. The field strength of 
our experiment is Fo = 0.0049 a.u. (middle panel), while that of ref. 6 corrected 
for the reflection loss is Fy = 0.008 a.u. (bottom panel). The vertical black 
dashed line marks the minimum bandgap (at 3.3 eV). 
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Contribution of changes in atmospheric circulation 
patterns to extreme temperature trends 


Daniel E. Horton'”, Nathaniel C. Johnson**"°, Deepti Singh’, Daniel L. Swain’, Bala Rajaratnam!”° & Noah S. Diffenbaugh!* 


Surface weather conditions are closely governed by the large-scale 
circulation of the Earth’s atmosphere. Recent increases in the 
occurrence of some extreme weather phenomena’” have led to 
multiple mechanistic hypotheses linking changes in atmospheric 
circulation to increasing probability of extreme events”. 
However, observed evidence of long-term change in atmospheric 
circulation remains inconclusive**. Here we identify statistically 
significant trends in the occurrence of atmospheric circulation 
patterns, which partially explain observed trends in surface tem- 
perature extremes over seven mid-latitude regions of the Northern 
Hemisphere. Using self-organizing map cluster analysis’”’’, we 
detect robust circulation pattern trends in a subset of these regions 
during both the satellite observation era (1979-2013) and the 
recent period of rapid Arctic sea-ice decline (1990-2013). 
Particularly substantial influences include the contribution of 
increasing trends in anticyclonic circulations to summer and 
autumn hot extremes over portions of Eurasia and North 
America, and the contribution of increasing trends in northerly 
flow to winter cold extremes over central Asia. Our results indicate 
that although a substantial portion of the observed change in 
extreme temperature occurrence has resulted from regional- and 
global-scale thermodynamic changes, the risk of extreme tempera- 
tures over some regions has also been altered by recent changes 
in the frequency, persistence and maximum duration of regional 
circulation patterns. 

Although most land regions show robust warming over the past 
century”, the pattern of change has not been spatially uniform”. 
This heterogeneity results from regional differences in the response 
of the climate system to increasing radiative forcing, and from the 
background noise of climate variability. Together, these factors 
substantially increase the challenge of climate change detection, attri- 
bution and projection at regional and local scales'*""*. 

The spatial pattern of changes in extreme weather events has gen- 
erated arguments that global warming has caused dynamic and/or 
thermodynamic changes that have differentially altered extreme event 


-2 -1.5 -1 
500 hPa geopotential height trends (1979-2013) (m yr") 
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probabilities’’”. Thermodynamic arguments are well understood and 
observed. For example, the accumulation of heat in the atmosphere has 
resulted in upward trends in hot extremes, downward trends in the 
majority of cold extremes, and more intense hydroclimatic events’. 
Dynamic arguments have greater uncertainties'*”. Changes in the 
large-scale atmospheric circulation—for instance, an increase in the 
occurrence or persistence of high-amplitude wave patterns—could alter 
the likelihood of extreme events”. Recent extremes in the Northern 
Hemisphere mid-latitudes'*’” have motivated hypotheses of a dynamic 
linkage between “Arctic amplification’, altered atmospheric circulation 
patterns, and changes in the probability of mid-latitude extremes**””. 
Despite divergent views on the causal direction of this linkage’, altered 
atmospheric dynamics are consistently invoked. Although trends in 
mean-seasonal mid-atmospheric geopotential heights have been iden- 
tified (fig. 2.36 of ref. 21; Fig. 1), evidence of changes in the occurrence of 
sub-seasonal atmospheric patterns remains equivocal, as does their 
contribution to extreme event probabilities®*. 

Previous efforts to detect trends in atmospheric circulation may 
have been hampered by narrowly defined, spatially sensitive, and/or 
non-standardized metrics*°*"’. We therefore employ a large-scale 
spatial characterization approach—self-organizing map (SOM) 
cluster analysis—to track the occurrence of highly generalized mid- 
atmospheric circulation patterns. We use 500 hPa geopotential height 
anomaly fields to describe daily circulation, and group each day’s pat- 
tern into one of a predefined number of SOM clusters based on a 
measure of pattern similarity’? (Methods). The number of clusters 
is largely dependent on the degree of specificity/generality required to 
test a particular hypothesis”"’”. To facilitate generalized large-scale mid- 
atmospheric classification, we use four clusters per domain. Using three 
reanalyses (Methods), we calculate linear trends (yr) in the time 
series of annual values of (1) the total number of days in each season 
on which each SOM pattern occurs (‘occurrence’ (d yr ')); (2) the 
mean length of consecutive occurrence (‘persistence’ (d event ‘)); 
and (3) the longest consecutive occurrence (‘maximum duration’ 
(d event ')). We consider trends in each metric to be robust if 


Figure 1 | Trends in mid-atmospheric 
geopotential heights. Northern 
Hemisphere polar projections of 
1979-2013 seasonal trends (m yr‘) in 
500 hPa geopotential heights. Trends 
are computed for winter (a; December, 
January, February (DJF)), spring 

(b; March, April, May (MAM)), 
summer (c; June, July, August (JJA)) 
and autumn (d; September, October, 
November (SON)) seasons. 
Geopotential height fields are sourced 


from NCEP-DOE-R2. 
1.5 2 
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Figure 2 | Trends in surface temperature extremes and atmospheric 
circulation patterns. Trends are calculated for each Northern Hemisphere 
season (December, January, February (DJF), winter; March, April, May 
(MAM), spring; June, July, August (JJA), summer; September, October, 
November (SON), autumn) for two periods: 1979-2013 (satellite era) and 
1990-2013 (ice era). Regional domains (see Fig. 3a) in which one or more of 
the four SOM circulation patterns demonstrate robust trends in mid- 
atmospheric circulation pattern occurrence (O), persistence (P), or maximum 
duration (M) are shown in green (Extended Data Figs 1 and 2). Positive (+) 
and negative (—) symbols are displayed when all three reanalyses show 
statistically significant trends in a particular circulation pattern (5% 
significance level; Methods), and agree on the sign of those trends. Multiple 
symbols within a box indicate multiple robust pattern trends. White boxes 
without symbols indicate no statistically significant trends and/or reanalysis 
disagreement (see Methods). Regional domains with positive and/or negative 
trends in cold (05) and/or hot (95) extremes receive (+) or (—) symbols 
when the three reanalyses agree on the sign of the area-weighted trend. Red and 
blue boxes indicate that the extreme temperature trend results in warming 
and cooling, respectively, while grey boxes indicate reanalysis disagreement. 


matching circulation patterns from all three reanalyses have statistically 
significant trends of the same sign. We assess the robustness of trends 
for seven mid-latitude regions (Figs 2 and 3a), over both the era of 
satellite observation (1979-2013; ‘satellite era’) and the era of rapidly 
diminishing Arctic sea ice’” (1990-2013; ‘ice era’). We report circula- 
tion patterns that pass these robustness criteria, but also discuss results 
in the context of (1) comprehensive multiple hypothesis testing; (2) 
removal of the assumption of linear time-series relationships; (3) use of 
fewer/more clusters; and (4) addition of atmospheric thermal dilation 
controls (Methods; Extended Data Table 1 and Extended Data Figs 1-3). 
Of the 112 total circulation patterns analysed in each period 
(Methods), the three reanalyses exhibit statistically significant trends 
in pattern occurrence for a total of 17, 16 and 16 patterns in the satellite 
era, and 15, 13 and 14 patterns in the ice era (Extended Data Table 1a). 
Of these significant occurrence trends, 12 satellite-era and 10 ice-era 
patterns are robustly significant across all three reanalyses (Fig. 2). The 
majority of robust satellite-era trends occur in summer and autumn, 
while robust ice-era trends are more evenly distributed over summer, 
autumn and winter. These patterns are diverse, and include anticyc- 
lonic, cyclonic and ‘dipole’ circulations (Extended Data Figs 4 and 5). 
Patterns with robust trends in both satellite and ice eras are limited to 
summer and autumn over western Asia and eastern North America. 
While the number of significant trends in pattern persistence varies 
from 5 to 10 across the individual reanalyses (Extended Data Table 1a), 
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only three robust pattern persistence trends are identified in each 
period (Fig. 2). Robust maximum duration trends are more prevalent, 
including five in the satellite era and six in the ice era. These are 
predominantly associated with summer anticyclonic patterns, 
although the maximum duration of central Asia winter troughing 
events demonstrates a robust ice-era increase (Extended Data Figs 4 
and 5). In regions with robust trends in multiple patterns, those 
patterns are generally complimentary. For example, in summer over 
eastern North America, robustly increasing satellite-era trends in anti- 
cyclonic patterns co-occur with robustly decreasing trends in cyclonic 
patterns. 

We next explore the extent to which trends in mid-atmospheric 
circulation patterns have influenced the likelihood of temperature 
extremes. For each period, we compute area-weighted trends in the 
seasonal occurrence of temperature extremes for all days, and for those 
days associated with each SOM pattern (for example, Fig. 3a, j-m; see 
Methods). The three reanalyses generally agree on the direction of 
all-days trends: consistent with enhanced radiative forcing and global 
warming, most regions and seasons show positive trends in hot 
occurrence, and negative trends in cold occurrence (Fig. 2). 

Hot extremes are projected to increase due to the dynamic and 
thermodynamic effects of global warming'”’. Consistent with other 
assessments’, we find substantial increases in extreme heat occurrence 
over the mid-latitudes (Extended Data Figs 6 and 7). For instance, the 
regional-mean occurrence of summer hot days over Europe, western 
Asia and eastern North America has increased 0.10, 0.16 and 0.13 d 
yr ‘yr ', respectively, over the satellite era (Fig. 3a and Extended Data 
Table 2a-c). By definition, one would expect (on average) ~4.5 5th/ 
95th percentile events per 3-month season, meaning that an increase 
of 0.10 d yr” yr~' accumulated over the course of the satellite era 
(35 years) yields an additional ~3.5 d yr_*, an ~75% increase. 

Heatwaves, similar to those which occurred in western Russia in 
2010 and Europe in 2003, develop when persistent anticyclonic 
patterns, often referred to as ‘atmospheric blocking’, initiate a cascade 
of self-reinforcing, heat-accumulating physical processes”. In addi- 
tion to the increasing trends in extreme heat occurrence, robust 
positive trends in the occurrence, persistence and maximum duration 
of satellite-era summer mid-atmospheric anticyclonic patterns are 
detected over Europe (Fig. 3c, g), western Asia (Fig. 4a, e), and eastern 
North America (Extended Data Fig. 4c). Robust positive trends in the 
occurrence of satellite-era anticyclonic patterns are also detected— 
along with increasing hot extremes—in autumn over eastern North 
America (Fig. 4c, g), eastern Asia (Fig. 4d, h) and central North 
America, and in spring over Europe (Extended Data Fig. 1). 

Increases in hot extremes may result from dynamic changes 
(namely greater occurrence and persistence of anticyclonic patterns) 
as well as from thermodynamic changes such as land cover change or 
global warming (reflected in the increased intensity of extreme 
temperature when anticyclonic patterns occur). Over Europe, the 
summer occurrence of circulations similar to dipole patterns with 
ridging over the eastern half of the domain (Fig. 3c) increased 0.45 d 
yr | yr‘ over the satellite era, while the persistence and maximum 
duration increased 0.05 and 0.19 devent ' yr, respectively (Fig. 3g). 
The trend in the frequency of hot events coincident with this pattern 
(0.06 d yr! yr~'; Fig. 3k and Extended Data Table 2a) accounts for 
~61.5% of the total trend in hot extremes over Europe (0.10 d yr — 1 yr’; 
Fig. 3a). In addition, the number of hot extremes per pattern occur- 
rence has increased for all four patterns (Fig. 3n—q and Extended Data 
Table 2a). Under the assumption of pattern stationarity, we perform a 
quantitative partitioning of the dynamic and thermodynamic contri- 
butions to the overall extreme temperature trend, as well as to the 
extreme temperature trends associated with each pattern'® (Methods). 
This partitioning reveals that ~27.3% of the 0.10 d yr‘ yr‘ overall 
increasing trend in hot extremes is driven by the dynamic influence of 
increased occurrence of the dipole pattern (along with ~35.2% ther- 
modynamic and ~ — 1.0% interaction influences). Additionally, of the 
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Figure 3 | Trends in circulation patterns and hot extremes over Europe. 

a, 1979-2013 trends in summer hot extreme occurrences for all regional 
domains based on 2-m maximum/minimum temperatures from NCEP- 
DOE-R2. b-e, SOM-derived mid-atmospheric circulation patterns (500 hPa 
geopotential height anomalies) over Europe. White boxed values show 
pattern frequencies in the top left and SOM node numbers in the top right. 
f-i, Time series of SOM circulation pattern occurrence (black (d yr')), 
persistence (blue (d event™')) and maximum duration (red (d event” ')). The 
slope of the trend line (yr~’) and P values (in parentheses) are colour coded, 


0.06 dyr™' yr‘ portion of the hot extremes trend associated with this 
pattern, ~57.3% is attributable to thermodynamic influences and 
~44.3% to increased pattern occurrence (Extended Data Table 2a). 
Together, these results suggest that the observed increase in extreme 
summer heat over Europe is attributable to both increasing frequency 
of blocking circulations and changes in the surface energy balance. 


with the values from 1979-2013 (solid trend line) displayed above those from 
1990-2013 (dashed trend line). j-m, Spatially rendered trends in hot extreme 
occurrences for days that correspond to each SOM circulation pattern. 
n-q, Time series of the area-weighted mean of hot extremes per pattern 
occurrence, referred to throughout the text as a measure of the intensity of 
temperature extremes associated with each pattern. Statistically significant 
trends (5% significance level; Methods) are shown by stippling in the 
mapped panels and by bold font in the scatter plots. 


Similar results are found in other regions that exhibit robust upward 
trends in anticyclonic patterns (Fig. 4 and Extended Data Table 2). 
Global warming is also generally expected to decrease the frequency 
of cold extremes’. In autumn over eastern Asia, the occurrence of 
satellite-era cold extremes decreased 0.08 d yr’ yr’ (Extended 
Data Fig. 6g and Extended Data Table 2f), indicating a reduction of 
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Figure 4 | Circulation pattern and thermal extreme trends for selected 
regions. Trends in thermal extreme occurrences for selected regions and 
seasons based on 2-m maximum/minimum temperatures from NCEP-DOE- 
R2. a-d, SOM-derived mid-atmospheric circulation patterns (500 hPa 
geopotential height anomalies) over western Asia in summer (a), central Asia in 
winter (b), eastern North America in autumn (c), and eastern Asia in autumn 
(d). White boxed values show pattern frequencies in the top left and SOM node 
numbers in the top right. In contrast to Fig. 3, just one of the four SOM 
circulation patterns is displayed from each region. e-h, Time series of SOM 
circulation pattern occurrence (black (d yr ')), persistence (blue (d event” ')) 
and maximum duration (red (d event ')). The slope of the trend line (yr!) 


~60% over the 35-year period. A majority of this decreasing trend 
(0.05 d yr ' yr’; Extended Data Table 2f) is attributable to changes 
associated with one pattern type: cyclonic circulations capable of 
advecting cold air equator-ward (Fig. 4d). Less frequent occurrence 
of cyclonic patterns (Fig. 4h), in conjunction with less intense cold 
temperature anomalies when cyclonic patterns occur (Fig. 4p), drives 
~62.5% of the overall decreasing trend (partitioned ~35.4% dynamic, 
~21.9% thermodynamic, ~5.3% interaction). Of the 0.05 d yr” | yr! 
decrease in extreme cold associated with the trend in cyclonic patterns, 
partitioning indicates ~56.5% dynamic and ~35.0% thermodynamic 
influences (Extended Data Table 2f). 
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and P values (in parentheses) are colour coded, with the values from 1979 to 
2013 (solid trend line) displayed above those from 1990 to 2013 (dashed trend 
line). i-I, Spatially rendered trends in thermal extreme occurrences for days that 
correspond to each SOM circulation pattern. m-p, Time series of the area- 
weighted mean of temperature extremes per pattern occurrence, referred to 
throughout the text as a measure of the intensity of temperature extremes 
associated with each pattern. Statistically significant trends (5% significance 
level; Methods) are shown by stippling in the mapped panels and by bold font in 
the scatter plots. Refer to Extended Data Figs 6 and 7 for satellite-era and ice-era 
trends in temperature extremes over the regional domains. 


In contrast to this expected extreme cold decrease, winter cold 
extremes over central Asia have increased 0.07 d yr_' yr! over the 
ice era (Extended Data Fig. 7a). One-hundred-and-forty-nine per cent 
of this trend (0.10 d yr * yr’; Fig. 4j) occurred when mid-atmo- 
spheric circulation was similar to a pattern of troughing in the south 
and east, and ridging in the northwest (Fig. 4b; ~111.4% dynamic, 
~25.6% thermodynamic, ~11.6% interaction). Trend percentages 
exceeding 100% indicate that other circulation patterns provide nega- 
tive contributions. Occurrence and persistence of this dipole pattern 
robustly increased (1.0 d yr! yr’ and 0.12 d event ' yr’, Fig. 4f) at 
the expense of all other circulations (Extended Data Table 2d). 


©2015 Macmillan Publishers Limited. All rights reserved 


Partitioning indicates that ~75.0% of the extreme cold trend assoc- 
iated with this pattern is due to the dynamic influence of increased 
pattern occurrence, with ~ 17.2% linked to thermodynamic influences 
(Extended Data Table 2d). 

Substantial dynamic contributions to the overall trend in cold 
extremes could be expected given that circulations that support the 
equator-ward advection of Arctic air will bring anomalously cold tem- 
peratures to lower-latitude locales”. Increased occurrence of such 
patterns has previously been observed, and linked to reduced regional 
sea-ice and decreased baroclinicity over the Barents and Kara 
seas*!”?°8, Positive thermodynamic contributions to the extreme cold 
trend indicate processes that are in opposition to the direct warming 
effects of enhanced radiative forcing. For example, positive ther- 
modynamic contributions from three of the four winter patterns over 
central Asia (Extended Data Table 2d) suggest that these thermody- 
namic contributions are largely independent of atmospheric circula- 
tion, and therefore potentially related to surface processes such as 
increased snow cover and enhanced diabatic cooling***. 

The circulation trends detected here cannot as yet be attributed to 
anthropogenic or natural causes, nor can they be projected to continue 
into the future. Attribution and projection will require an increased 
understanding of the causes of the circulation trends, including the 
ability to identify the signal of an anthropogenically forced trend from 
the noise of internal decadal-scale climate variability'’°”. However, our 
quantitative partitioning, in conjunction with targeted climate model 
simulations'*”””®, offers the potential to fingerprint dynamic and ther- 
modynamic climate influences in isolation, which in turn may facil- 
itate attribution of the observed trends, and projection of future trends. 
We hypothesize that the main assumption of our quantitative parti- 
tioning—pattern stationarity—is justified given the expectation that 
circulation responses to enhanced radiative forcing are likely to rein- 
force pre-existing modes of natural variability’*"*. A related assump- 
tion is that the reanalyses act as reasonable proxies for the state of the 
three-dimensional atmosphere through time. Given uncertainties in 
the data assimilation and numerical modelling that underpin atmo- 
spheric reanalysis, we have restricted our identification criteria to those 
trends that are statistically significant in all three reanalyses. 

Our approach finds robust trends in mid-atmospheric circulation 
patterns over some regions, and suggests that both dynamic and ther- 
modynamic effects have contributed to observed changes in temper- 
ature extremes over the past 35 years. Although thermodynamic 
influences have largely dominated these changes, dynamic influences 
have been critical in some regions and seasons. Long-term projections 
of future dynamic contributions are challenging given the substantial 
underlying decadal-scale variability, as well as the uncertain impact of 
anthropogenic forcing on mid-latitude circulation'’*’®. However, given 
our finding that many patterns have exhibited increasing (decreasing) 
intensity of extreme hot (cold) events, and that those trends are coin- 
cident with a nearly categorical increase in thermodynamic forcing, 
the observed trends of increasing hot extremes and decreasing cold 
extremes could be expected to continue in the coming decades, should 
greenhouse gases continue to accumulate in the atmosphere. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Categorization of circulation patterns. We use SOM cluster analysis”? to cat- 
egorize large-scale circulation patterns over seven Northern Hemisphere 
domains*' using daily 500 hPa geopotential height anomaly fields from the 
NCAR/NCEP-RI (ref. 32), NCEP-DOE-R2 (ref. 33) and ECMWE ERA-Interim 
(ref. 34) reanalyses. Daily anomalies are calculated by subtracting the seasonal 
cycle (calendar-day mean) from each grid cell. Reanalyses are analysed individu- 
ally to maintain their physical consistency, and to facilitate their intercomparison. 
The SOM’s unsupervised learning algorithm requires neither a priori knowledge 
of which types of circulation patterns might be detected, nor the specific geo- 
graphic regions in which they might occur. Geopotential height anomaly fields 
from each day are assigned to one of a pre-defined number of nodes, according to 
pattern similarity. The final SOM patterns are obtained by minimizing the 
Euclidian distance between iteratively updated nodes and their matching daily 
geopotential height anomaly fields'’. Each SOM pattern can therefore be viewed as 
a representative composite of relatively similar circulation patterns. 

Owing to global-scale warming, trends in geopotential height anomalies record 
both altered atmospheric circulation patterns and the thermal expansion of the 
troposphere. To isolate the signal of circulation pattern change, previous clustering 
analyses have assumed uniform thermal dilation and removed either the domain 
average’ or domain average linear trend’® from the daily-scale anomalies. In our 
analysis, we find that 1979-2013 trends in Northern Hemisphere geopotential 
height anomalies are non-uniform in both magnitude and sign (Fig. 1), and 
demonstrate substantial seasonal, regional and latitudinal differences (Extended 
Data Fig. 3e). These findings suggest that for the relatively short period of our 
analysis, an assumption of uniform thermal dilation is inappropriate. Moreover, 
the strong spatial heterogeneity indicates the importance of large-scale dynamics 
in the regional geopotential height trends, and so the removal of local geopotential 
height trends would conflate dynamic changes with thermal dilation. Therefore, in 
the main text we present results and conclusions based on raw geopotential height 
data. However, despite the lack of uniform expansion, we have conducted an 
analysis that attempts to account for the effects of thermal dilation. SOM analyses 
are performed on geopotential heights that have been detrended by removing the 
seasonal mean hemispheric trend (Extended Data Fig. 3e) from each grid cell. 
Results from this analysis indicate that the magnitude, significance and sign of 
circulation trends are sensitive to the method of controlling for thermal expansion 
(Extended Data Fig. 3f-j). Despite this sensitivity, the conclusions presented in the 
main text are supported, in that both raw and detrended analyses generally suggest 
trends of similar magnitude, sign and significance. 

Based on domain-wide pattern correlations between daily height field anom- 
alies and different SOM node counts’* (Extended Data Fig. 8), we divide circula- 
tion patterns over each domain into four SOM nodes (for example, Fig. 3b-e). To 
determine a suitable number of nodes, a suite of different node counts were 
analysed. We found that four nodes were sufficiently great in number to capture 
a diversity of highly generalized circulation patterns, but sufficiently few to facil- 
itate convenient presentation and, critically, to prevent overly similar SOM pat- 
terns’”. To test the sensitivity of our results to the number of nodes, we present 2-, 
4-, 8- and 16-node SOMs for the summer season over the European domain 
(Extended Data Figs 1 and 2). Based on these analyses, it is apparent that a 2-node 
SOM is insufficient to capture the diversity of circulation patterns that are found in 
the reanalysis data (Extended Data Fig. 1a), whereas 8- and 16-node SOMs pro- 
duce nodes with overly similar circulation patterns (Extended Data Figs 1c and 2). 
Examination of the 4-, 8- and 16-node SOMs largely verifies the conclusions 
drawn from the 4-node SOM: that the occurrence of patterns with ridging over 
the eastern half of the European domain has increased over time, while the occur- 
rence of complimentary patterns has decreased. We note that these pattern trends 
are not identified in the 2-node SOM, confirming that two nodes are too few to 
capture specific circulation patterns that are critical for extreme temperature 
occurrence. Similar node-count analyses for other regions/seasons likewise verify 
the conclusions drawn from the 4-node analyses that are presented in the main 
text (not shown). 

Calculation of robust trends in circulation patterns. For each season in each 
year, we calculate (1) the total number of days on which each SOM pattern occurs 
(occurrence (d yr')); (2) the mean length of consecutive occurrence (persistence 
(d event ‘)); and (3) the longest consecutive occurrence (maximum duration (d 
event ')). A trend in one of these characteristics is considered robust when the 
trend in that pattern is (1) statistically significant in all three reanalyses, and (2) of 
the same sign in all three reanalyses. Trends with regression coefficients that 
surpass the 5% significance (95% confidence) threshold are considered statistically 
significant. Trends are calculated across satellite-era and ice-era annual time series 
using the approach of ref. 37, which allows us to account for temporal dependence. 
Here, the trends are calculated using linear least squares regression, but to account 
for temporal dependence, the confidence bounds of annual time-series trends with 


lag-one autocorrelation greater than the 5% significance level are recalculated by 
adjusting the number of degrees of freedom used to compute the regression 
coefficient significance’. As a result, unlike a simple linear regression, this 
approach does not rely on the independent and identically distributed assumption 
for the residuals, but instead accounts for temporal dependence using a red noise 
assumption. 

The approach of ref. 37 also assumes that the distribution of the residuals is 
Gaussian. Using the Anderson—Darling™ test for normality, we find that 91-100% 
of residual distributions in each metric of each reanalysis do not reject the null 
hypothesis of Gaussianity when multiple hypothesis testing controlling the family- 
wise error rate’ (FWER) at the 5% significance level is considered (Extended Data 
Table 1b). The Gaussianity assumption is therefore largely appropriate. However, 
due to the identification of non-normality in some distributions, particularly in the 
persistence and maximum duration metrics, we apply Box-Cox power transfor- 
mations*’ to all distributions. Using the Anderson-Darling test, we find that 96- 
100% of the distributions of the residuals in the transformed setting do not reject 
the null hypothesis of Gaussianity at the 5% significance level. Furthermore, when 
multiple hypothesis testing is considered by controlling the FWER at the 5% level, 
100% of the individual tests are non-significant (Extended Data Table 1b). In 
addition, the number of pattern trends identified as significant in the transformed 
case is largely consistent with the non-transformed regression analysis. However, 
additional significant trends in the persistence and maximum duration metrics are 
identified when Box—Cox transformations are used (Extended Data Table 1a). The 
large overlap of results between the transformed and non-transformed analyses 
suggests that although individual residual distributions may vary, the Gaussian 
assumption applied throughout this study is for the most part quite robust, 
although in some cases not valid. In sum, the non-transformed analysis that fits 
a linear relationship to circulation pattern metrics allows for a relatively simple 
classification of two short analysis periods, while simultaneously accounting 
for temporal dependence in a large number (>2,000) of individual time series 
(7 regions X 4 nodes X 4 seasons X 3 characteristics X 2 time periods X 3 
reanalyses). 

Because SOM nodes are calculated independently for each reanalysis, individual 

SOM patterns must be matched between the three reanalyses in order to determine 
whether an individual pattern shows robust results across all three reanalyses. To 
determine which SOM patterns are the closest match between the three reanalyses, 
the root mean square error (RMSE) is calculated between the SOM patterns of one 
reanalysis and those of the other reanalyses. Patterns with the smallest RMSEs are 
considered matches. Although we undertake a multi-reanalysis robustness evalu- 
ation, further work is needed to confirm that other available reanalyses (such as 
CFSR and MERRA) show the same trends. 
Multiple hypothesis testing of linear trends. It is possible that some of the trends 
identified as significant in any individual reanalysis could occur by chance. In 
addition to screening for those patterns that are significant in all three reanalyses 
(our ‘robustness’ criterion), we also employ formal multi-hypothesis testing using 
several methodologies. The first is the familywise error rate (FWER). This type of 
error metric controls the probability of falsely rejecting any null hypothesis, and is 
considered one of the strictest forms of error control*’. Since a certain number of 
false rejections can happen by chance alone, one can account for this formally by 
using the k-familywise error rate*’ (k-FWER). The k-FWER controls the prob- 
ability of falsely rejecting k or more null hypotheses, and aims to formalize the 
concept that some of the hypotheses will be rejected by chance. One option for the 
value of k is to use the expected number of hypotheses that will be rejected at a 
given significance level. For instance, in our study, out of 112 total ‘local’ hypo- 
theses, 5 or 6 hypotheses will be significant at the 5% significance level by chance 
(112 X 0.05 = 5.6). In this case, one can evaluate the probability that 7 or more 
hypotheses are falsely rejected, since on average about 6 could be rejected as 
significant by chance. The third metric is the false discovery rate (FDR), which 
controls the expectation of the ratio given by the number of false rejections divided 
by the total number of rejections*’. 

All of the above measures of error control aim to guard against hypotheses being 
falsely declared as significant in the context of multiple tests. To be thorough, we 
have implemented all three types of error control at both global significance levels 
of 5% and 10%. The results of these analyses are summarized in Extended Data 
Table 2. We note that all three metrics heavily favour the null as they are designed 
to protect against the possibility of false positives. Despite this, the presence of local 
tests that reject the null represents a strong confirmation of the significance of 
those local tests. The fact that a number of local hypotheses still prevail as signifi- 
cant, even after imposing much stricter multiple testing error controls, arises partly 
from the fact that some of the local P values indicate trends that are so highly 
significant that they can withstand the stricter multiple testing error control met- 
rics. This rigorous multiple-testing error control yields increased credibility to the 
scientific conclusions of robust trends in pattern occurrence. 
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Temperature extremes. Daily-scale hot and cold extreme occurrences are calcu- 
lated using temperature anomalies at each grid cell. Temperature anomalies are 
computed by removing the seasonal cycle from daily reanalysis 2-m maximum/ 
minimum temperatures. Similar to previous studies'’, temperature extremes are 
calculated based on the statistical distribution of daily temperature anomalies”. 
Hot/cold extreme thresholds are defined as the 95'"/5" percentile value of the 
1979-2013 daily 2-m maximum/minimum temperature anomaly distribution (for 
example, for JJA, the population of daily-maximum temperature anomalies from 
the months of June, July and August in the years 1979-2013). Hot/cold extreme 
occurrences are defined as days on which the daily temperature anomalies are 
greater/less than (or equal to) the hot/cold extreme thresholds. Reanalysis tem- 
perature extremes are qualitatively similar to those found in station-based obser- 
vations”. Given this similarity, we use the reanalysis temperatures in order to 
maintain internal physical consistency between daily 2-m temperatures and daily 
atmospheric circulation (as represented by the 500 hPa SOM circulation patterns). 
Trends in temperature extreme occurrence are computed across satellite-era and 
ice-era annual time series following the methodology of ref. 37. 

Quantitative partitioning. To determine the dynamic and thermodynamic con- 
tributions to trends in temperature extreme occurrence, we adapt the climate 
change partitioning methodology of Cassano et al.'°. Our adapted methodology 
partitions the contributions of dynamic and thermodynamic changes to (1) the 
overall trend in temperature extreme occurrence and (2) the trends associated with 
individual SOM circulation patterns. Previous applications of the Cassano et al.'° 
methodology indicate that partitioning is largely insensitive to the number of SOM 
nodes used in the analysis**. All trends in temperature extremes in the below 
methodology are area-weighted averages. Following Cassano ef al.’°: 


K 
E=) Efi 
i=1 


where E is the frequency of extreme temperature occurrence, f; is the frequency of 
occurrence of SOM pattern i, E; is the frequency of extreme temperature occur- 
rence when SOM pattern i occurs, and K is the total number of SOM nodes. We 
decompose E and f into time mean and deviation from time mean components: 


K 
B= OG +E +8) 
i=l 


Now we differentiate the above equation with respect to time, noting that the mean 
values are constants: 


dE S/cde _dfi d_,., 
dt 2» (7 deo de +5066) 


i=1 


The derivative on the left-hand side provides the area-weighted average trend in 
the seasonal occurrence of temperature extremes for all days. The summation on 
the right-hand side, from left to right, provides the thermodynamic, dynamic and 
interaction contributions for days associated with each SOM pattern, i. 

The thermodynamic contribution of each circulation pattern’s extreme tem- 
perature trend assumes that each SOM pattern is stationary in time, and that 
trends in extremes that result during this pattern are the result of influences 
unrelated to circulation, such as changes in long-wave radiation from increasing 
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greenhouse gas concentrations, or changes in surface fluxes of moisture and/or 
radiation resulting from changes in land cover. The thermodynamic contribution 
associated with each circulation pattern is determined by taking the product of the 
trend in the intensity of temperature extremes and the mean occurrence of the 
circulation pattern. Trends in the intensity of temperature extremes are computed 
by calculating the trend in area-weighted extreme occurrence per pattern occur- 
rence (for example, Fig. 3n-q). 

The dynamic contribution of each circulation pattern’s extreme temperature 
trend assumes that, on average, a circulation pattern is associated with a portion of 
the total extreme event trend, and that changes in the occurrence frequency of that 
circulation pattern will modify the occurrence frequency of extreme events. The 
dynamic contribution associated with each circulation pattern is determined by 
taking the product of the trend in circulation pattern occurrences and the mean 
number of extreme events per pattern occurrence. 

The third component represents the interaction between dynamic and ther- 

modynamic changes, and captures contributions that result from changes in the 
dynamic component acting on changes in the thermodynamic component, such as 
the positive/negative feedbacks of surface-atmosphere interactions. The interact- 
ive term is determined by computing the trend in the product of circulation 
pattern occurrence deviations and intensity of temperature extreme deviations. 
Code availability. SOM code is available at http://www.cis.hut.fi/projects/ 
somtoolbox/. All other analysis code is available upon request from the corres- 
ponding author (danethan@stanford.edu). 
Reanalysis data sets. NCAR/NCEP-Reanalysis 1 data was downloaded from 
http://www.esrl.noaa.gov/psd/; NCEP-DOE-Reanalysis 2 data was downloaded 
from http://www.esrl.noaa.gov/psd/; ECMWF ERA- Interim data was downloaded 
from http://www.ecmwf.int/. 
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Extended Data Figure 1 | 2-, 4- and 8-node SOM analyses. SOM-derived (devent ')). The slope of the trend line (yr') and P values (in parentheses) are 
mid-atmospheric summer (JJA) circulation patterns (500 hPa geopotential colour coded, with the values from 1979 to 2013 (solid trend line) displayed 
height anomalies) over Europe using 2- (a), 4- (b) and 8-node (c) analyses. above those from 1990 to 2013 (dashed trend line). Statistically significant 
White boxed values show pattern frequencies in the top left and SOM node trends (5% significance level; Methods) are shown by bold fonts in the scatter 
numbers in the top right. Time series of SOM circulation pattern occurrence _ plots. Geopotential height fields are sourced from the NCEP-DOE-R2 

(black (d yr')), persistence (blue (d event ')) and maximum duration (red reanalysis”’. 
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-0.068(0.72)  0,005(0.90) — -0.039(0.62) -0.143(0.29)  -0.030(0.31) — -0.055(0.27) -0.105(0.30)  0.008(0.70) —0.014(0.60) -0.190(0.03) -0.018(0.69)  -0.057(0.22) 


Occurrence Persistence Max Duration 
25 4 0.024(0.76) 0.001(0.98) _0.023(0.53) 
0.096(0.48) —0.054(0.15) _0.093(0.14) 


Occurrence Persistence Max Duration 
-0.003(0.97) 0.015(0.46) _0.025(0.38) 
0.143(0.29) 0.030(0.48) _0.073(0.19) 


Occurrence Persistence Max Duration 
-0.216(0.02) -0.038(0.07) -0.081 (0.04) 
0.342(0.01) -0.064(0.04) -0.1 44(0.01) 


Occurrence Persistence Max Duration 
-0.086(0.13) -0.016(0.31) _-0.032(0.27) 
-0.022(0.82) _0.005(0.82) —_0.000( 1.00) 


1980 1990 2000 2010 1980 1990 2000 2010 1980 1990 2000 2010 1980 1990 2000 2010 


Circulation pattern occurrence, persistence & maximum duration 
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Extended Data Figure 2 | 16-node SOM analysis. SOM-derived mid- The slope of the trend line (yr) and P values (in parentheses) are colour 
atmospheric summer (JJA) circulation patterns (500 hPa geopotential height coded, with the values from 1979 to 2013 (solid trend line) displayed above 
anomalies) over Europe derived from a 16-node analysis. White boxed values _ those from 1990 to 2013 (dashed trend line). Statistically significant trends 
show pattern frequencies in the top left and SOM node numbers in the top (5% significance level; Methods) are shown by bold fonts in the scatter plots. 
right. Time series of SOM circulation pattern occurrence (black (d yr ')), Geopotential height fields are sourced from the NCEP-DOE-R2 reanalysis*®. 
persistence (blue (d event” ')) and maximum duration (red (d event™')). 
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2 -I5 -I -05 0 05 | ls. 2 
500 hPa geopotential height trends (1979-2013) (m yr!) 


e 1979-2013 Trends in area-weighted 
geopotential heights (m yr!) 

DJF MAM JJA SON 

N.Hemisphere 0.347 0.304 0.232 0.293 

Euro -0.210 0.708 0.221 0.053 

wAsia 0.570 0.361 0.656 0.585 

cAsia -0.271 0.491 0.130 0.151 

eAsia 0.005 0.100 0.439 0.551 

wNAm 0.370 0.314 0.438 0.386 

cNAm 0.188 0.409 0.207 0.550 

eNAm 0.370 0.319 0.527 0.818 

f Euro-JJA-1979-2013 
Siar a = =e . 

OCC | PER DUR OCC | PER DUR OCC | PER DUR OCC | PER DUR 
None -0.161 | -0.009 0.004 0.45 0.054 0.192 -0.328 | -0.039 | -0.074 0.039 | -0.01 -0.031 
N.Hemisphere | -0.196 | -0.013 | -0.002 0.42 0.05 0.173 -0.33 | -0.037 | -0.067 0.105 | -0.004 | -0.005 


wAsia-JJA-1979-2013 
Biss : A CA A-c Cc 
Sree Occ | PER | DUR | OCC | PER | DUR | OCC | PER | DUR | OCC] PER | DUR 
None 0.546 | 0.061 | 021 | -0366 | -0.04 | -0.135 | 0.057 | 0.005 | -0.02 | -0.237 | 0.031 | 0.035 
N.Hemisphere | 0.352 | 0.034 | 0.127 | -0.301 | -0.03 | -0.126 | 0.114 | 0.004 0 -0.165 | 0.036 | 0.046 
h =r eNAm-SON-1979-2013 
Removed = sakes oe ¢ 
Occ | PER | DUR | OCC | PER | DUR | OCC | PER | DUR | OCC | PER | DUR 
None 0.438 | 0.019 | 0.086 | -0.285 | -0.01!1 | -0.063 | 0.097 | 0.013 | -0.037 | -0.25 | -0.007 | 0.003 
N.Hemisphere | 0.366 | 0.017 | 0.075 | -0.266 | -0.005 | -0.055 | 0.091 | OO1 | -0.041 | -0.19 | -0.002 | 0.003 
i er eAsia-SON-1 979-2013 
Removed Pe Z . as 
Occ | PER | DUR | OCC | PER | DUR | OCC | PER | DUR | OCC | PER | DUR 
None 0.114 | -0.01 | 0.015 | 0.366 | 0.025 | 0.091 | -0.374 0 -0.038 | 0.122 | 0.02 | 0.049 
N.Hemisphere | -0.101 | -0.003 | 0.015 | 0.217 | 0.01 | 0.036 | -0.229 | 0.034 | -0.017 | 0.113 | 0.023 | 0.049 


j cAsia-DJF-1990-2013 
aon A = = 2 
occ PER DUR occ PER DUR occ PER DUR OCC | PER DUR 
None -0.127 0.03 0.127 1.044 0.119 0.262 | -0.621 0.013 -0.07 -0.296 | -0.005 | -0.042 
N.Hemisphere -0.169 | 0.037 0.103 1.105 0.144 0.319 | -0.701 0.021 -0.056 | -0.235 | 0.004 | 0.006 
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Extended Data Figure 3 | Geopotential height trends and thermal dilation —_units of d yr _' yr ‘ for domains and seasons highlighted in the main text. 


adjustment. a-d, Northern Hemisphere polar projections of 1979-2013 The magnitudes of the (removed) seasonal Northern Hemisphere trends 
seasonal trends in 500 hPa geopotential heights (same as Fig. 1, reproduced _can be found in e. Grid cells highlighted in grey contain trends significant 
here for convenience). e, Area-weighted trends in seasonal geopotential at the 5% level (Methods). SOM circulation patterns are abbreviated as 
heights over the Northern Hemisphere and regional SOM domains. follows: A, anticyclonic; C, cyclonic; and combinations of the two represent 
f-j, Trends in raw and detrended geopotential height SOM pattern dipole patterns and west-east configurations. Geopotential height fields are 
occurrence (OCC), persistence (PER) and maximum duration (DUR) in sourced from the NCEP-DOE-R2 reanalysis*’. 
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0.450(0.00) _0.054(0.00) _0.192(0.00) =0.328(0.03)  -0.039(0.03) -0.074(0.18) 0.546(0.01) _0.061(0.40)  0.210(0.05) =0.366(0.01) _-0.040(0.06) -0.135(0.01 
0.441(0.05) —0.054(0.13)  0.261(0.01) -0.355(0.13) -0.043(0.07) —-0.092(0.33) 1.216(0.00) —0.1/9(0.25)  0.382(0.04) 60-4-0.420(0.14) -0.061(0.13)  -0.132(0.13) 


120E 


0.297(0.02) 0.053(0.01) 0.169(0.01)} -0.334(0.00) -0.058(0.00) -0.146(0.00) 0.366(0.01)"  O.025(0-4 1) ~ ‘O.0FI(0.13) -0.374(0.01) 0.000(0.99) _-0.038(0.43) 


-0.200(0.30) 0.030(0.40) —0.006(0.95) 


0.654(0.01)  0.091(0.04) _0.231(0.08) -0.401(0.04) -0.040(0.15)  -0.210(0.02) 0.201 (0.46) -0.020(0.74)  0.056(0.62) 


1990 


-0.362(0.01) -0.048(0.04) -0.105(0.07) 0.325(0.02) —0.03(0.17) 0.075(0.22) 0.438(0.00) —0.019(0.23) 0.086(0.07) 0.325(0.03) —0.048(0.21) —_0.056(0.46) 
-0.518(0.02) -0.054(0.18) —-0.051(0.54) 0.397(0.16) 0.034(0.42) —-0.146(0.23) 0.467(0.03) —-0.003(0.93) _0.035(0.70) 0.097(0.75) 0.034(0.65) —_-0.021 (0.88) 


-270 -180 -90 0 90 180 270m 
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Extended Data Figure 4 | 1979-2013 (satellite era) robust atmospheric d, autumn over eastern Asia; e, autumn over western Asia; f, autumn over 
circulation pattern trends. Time series of circulation pattern occurrence central North America; g, autumn over eastern North America; and h, spring 
(black (d yr ')), persistence (blue (d event ')) and maximum duration (red over Europe. Statistically significant trends (yr '); 5% significance level; 

(d event” ')) from the NCEP-DOE-R2 reanalysis’?: a, summer over Europe; Methods) are identified by bold font in the scatter plots. 

b, summer over western Asia; c, summer over eastern North America; 
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wAsia cAsia 


-0.038(0.77) 0.003(0.87) _-0.005(0.91) -0.246(0.09) 0.028(0.24) -0.023(0.69) 0.249(0.21) 0.019(0.50) _0.063(0.41) -0.178(0.21) 0.005(0.78) _0.010(0.87) 
-0.463(0.02) —0.02(0.50) —-0.068(0.37) -0.668(0.01) —0.046(0.35) —_-0.071(0.46) -0.621(0.02) —0.013(0.71) _-0.070(0.55) 


wAsia eNAm 


0.546(0.01) 0.061 (0.40) 0.210(0.05) -0.237(0.15)  0.031(0.39) —_(0.035(0.58) 0.297(0.02) 0.053(0.01) 0.169(0.01) -0.334(0.00) -0.058(0.00) -0.146(0.00) 
1.216(0.00) —0.1/9(0.25)  0.382(0.04) -0.750(0.01) — -0.075(0.27) —-0.151(0.18) 0.654(0.01) — 0.091(0.04) 0.231(0.08) -0.401(0.04) -0.040(0.15)  -0.210(0.02) 


-0.362(0.01) -0.048(0.04) -0.105(0.07) . . 0.019(0.23)  0.086(0.07) 
-0.518(0.02) -0.054(0.18) _-0.051(0.54) ‘ . -0.003(0.93)  0.035(0.70) 
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Extended Data Figure 5 | 1990-2013 (ice era) robust atmospheric b, winter over central Asia; c, summer over western Asia; d, summer over 
circulation pattern trends. Time series of circulation pattern occurrence eastern North America; e, autumn over western Asia; and f, autumn over 
(black (d yr ')), persistence (blue (d event ')) and maximum duration (red eastern North America. Statistically significant trends ((yr~’); 5% significance 
(devent™ *)) from the NCEP-DOE-R2 reanalysis’: a, winter over western Asia; _ level; Methods) are identified by bold font in the scatter plots. 
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Extended Data Figure 6 | 1979-2013 (satellite era) Northern Hemisphere autumn cold (g) and hot (h) occurrences. Trends are calculated from the 
extreme temperature occurrence trends. Satellite-era extreme temperature © NCEP-DOE-R2? reanalysis 2-m daily maximum/minimum temperatures”. 
trends (d yr * yr’) for winter cold (a) and hot (b) occurrences; spring cold Grid cells with statistically significant trends (5% significance level; Methods) 
(c) and hot (d) occurrences; summer cold (e) and hot (f) occurrences; and are stippled. 
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Extended Data Figure 7 | 1990-2013 (ice era) Northern Hemisphere (g) and hot (h) occurrences. Trends are calculated from the NCEP-DOE-R2 
extreme temperature occurrence trends. Ice-era extreme temperature trends _ reanalysis 2-m daily maximum/minimum temperatures”’. Grid cells with 
(d yr! yr’) for winter cold (a) and hot (b) occurrences; spring cold (c) and _ statistically significant trends (5% significance level; Methods) are stippled. 
hot (d) occurrences; summer cold (e) and hot (f) occurrences; and autumn cold 
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Extended Data Figure 8 | Sensitivity of pattern similarity to number of 
SOM nodes. To determine an adequate number of SOM nodes, we follow a 
modified version of the methodology introduced by ref. 12, wherein the mean 
pattern correlation of all daily geopotential height anomaly fields and their 
matching SOM node patterns are computed for a suite of different SOM node 
counts (3, 4, 5, 6, 7 and 8), for all regions and all seasons (black dots). We also 
compute the maximum/minimum pattern correlation of daily geopotential 
height anomaly fields with their matching SOM node pattern (red dots) and the 
maximum/minimum SOM-pattern-to-SOM-pattern correlation (blue 


LETTER 


triangles). The goal is to select an adequate number of nodes such that: (1) the 
mean pattern correlation of all daily geopotential height anomaly fields is 
relatively large; (2) the minimum pattern correlation of daily geopotential 
height anomaly fields is relatively large; and (3) the maximum SOM-pattern- 
to-SOM-pattern correlation is relatively small. Similar to ref. 12, we find that 
four SOM nodes are generally sufficient to capture the different modes of 
atmospheric variability, but small enough that SOM patterns depict distinct 
circulations. Geopotential height anomaly fields are sourced from the NCEP- 
DOE-R? reanalysis’. 
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Extended Data Table 1 | Significant reanalysis circulation pattern trends and summary of multiple hypothesis testing 


a 
Statistically Significant Trends (P<0.05) 
satellite-era ice-era 
n occ per dur occ per dur 
Linear Regression Analysis 
NCAR/NCEP-RI 112 17 6 7 15 10 9 
NCEP-DOE-R2 112 16 5 7 13 8 9 
ERA-Interim 112 16 6 10 14 9 9 
Box-Cox Transformation Analysis 
NCAR/NCEP-RI 112 15 9 8 16 9 10 
NCEP-DOE-R2 112 15 6 7 16 10 10 
ERA-Interim 112 16 8 12 14 9 I 
b 
Anderson-Darling Test for Normality 
satellite-era ice-era 
n occ per dur occ per dur 
Original Data 
NCAR/NCEP-RI 112 103 47 56 102 76 68 
NCEP-DOE-R2 112 102 57 58 102 74 70 
ERA-Interim 112 99 62 49 105 76 65 
Original Data with FWER Multiple Hypothesis Testing 
NCAR/NCEP-RI 112 112 102 102 112 103 107 
NCEP-DOE-R2 112 112 100 103 112 107 109 
ERA-Interim 112 112 100 102 112 104 106 
Box-Cox Transformed Data 
NCAR/NCEP-RI 112 110 109 110 109 110 112 
NCEP-DOE-R2 112 HI 110 110 110 HI 112 
ERA-Interim 112 110 II 109 Hl 109 108 
Box-Cox Transformed Data with FWER Multiple Hypothesis Testing 
NCAR/NCEP-RI 112 112 112 112 112 112 112 
NCEP-DOE-R2 112 112 112 112 112 112 112 
ERA-Interim 112 112 112 112 112 112 112 
c 
Multiple Hypothesis Testing 
Control satellite-era ice-era 
Type Level occ per dur occ per dur 
NCAR/NCEP-RI 
5% 2 0 0 0 | 0 
PYYER 10% 2 0 | I I 0 
5% 4 I 2 I 2 0 
EES 10% 7 2 3 4 2 2 
5% 0 0 0 0 0 0 
FBR 10% 2 0 0 0 0 0 
NCEP-DOE-R2 
5% 2 0 0 0 | 0 
EER 10% 2 0 | 0 | 0 
5% 2) I 2 2) 2 0 
posal 10% 5 2 4 5 3 0 
5% 0 0 0 0 0 0 
een 10% 2 0 0 0 0 0 
ERA-Interim 
5% 0 0 | 0 0 0 
EVER 10% I 0 | 0 0 0 
5% 3 2 2 2 0 | 
aes 10% 7 D 2 5 3 3 
5% 0 0 0 0 0 0 
ror 10% 0 0 0 0 0 0 
a, We indicate the number of linear regression and Box-Cox transformed statistically significant occurrence (occ), persistence (per) and maximum duration (max dur) trends that surpass the 95% confidence 
threshold (5% significance level; Methods) for each reanalysis in each analysis period (1979-2013 is ‘satellite-era’ and 1990-2013 is ‘ice-era’). For a given metric (occ, per and max dur) in a given period (satellite- 


era and ice-era), there are four patterns per region, over four seasons, for 7 domains. There are therefore 112 total trends in each metric in each period. b, Results of Anderson—Darling?® tests of normality indicating 
the number of original and Box—Cox transformed’? distributions of residuals that do not reject the null hypothesis of Gaussianity at the 5% level. For both the original and Box-Cox Anderson-Darling tests, we also 
apply multiple hypothesis testing to control the familywise error rate°? (FWER) at the 5% significance level. c, To protect against the possibility of false positives in the linear regression trend analyses, for each 
reanalysis, we use three different multiple hypothesis error control methodologies to assess the number of locally significant hypotheses that surpass the global 5% or 10% significance level under strict error 
control requirements. FWER®°, k-familywise error rate*+ (k-FWER), and false discovery rate®? (FDR) analyses are applied (Methods). We note that all three metrics heavily favour the null. The presence of local tests 
that reject the null therefore represents a strong confirmation of the significance of those local tests, as those local tests are so highly significant that they can withstand the stricter multiple testing error control 
criteria. 
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Extended Data Table 2 | Quantitative partitioning of temperature extreme trends for select SOM analyses 


trend patocc occtrend perpatocc] trend trend trend trend j oftrend dynamic thermo inter 
-0.16 24.94 0.001 -0.01 0.02 0.00 0.01 10.1 -112.0 237.9 -25.9 
0.45 23.66 0.002 0.03 0.04 0.00 0.06 61.5 44.3 57.3 -1.6 
-0.33 23.09 0.001 -0.01 0.02 0.00 0.01 8.7 -109.3 200.9 8.4 
0.04 20.31 0.001 0.00 0.02 0.00 0.02 19.7 5.9 96.6 -2.5 


Overall: 0.01 0.10 0.00 0.10 


Patoce Mean T95perpat Mean T95 Percent Percent Percent Percent 
trend patocc occtrend  perpatocc]| trend trend trend trend j oftrend dynamic thermo inter 
0.55 25.94 0.002 0.04 0.04 0.03 0.11 36.5 37.8 25.7 
-0.37 25.40 0.001 -0.02 0.04 0.00 0.02 -92.3 198.0 -5.7 
0.06 22.69 0.001 0.00 0.02 -0.01 0.02 9.9 122.3 -32.1 
-0.24 17.97 0.001 -0.01 0.02 -0.01 0.01 -72.2 240.8 -68.6 


Overall: 0.02 0.12 0.01 0.16 


trend patocc occtrend  perpatocc] trend trend trend trend j oftrend dynamic thermo inter 

0.30 22.31 0.02 0.04 0.01 0.07 51.4 29.9 56.8 13.3 

-0.02 21.63 0.00 0.02 0.00 0.02 13.1 -5.1 114.4 -9.3 

0.05 26.51 0.00 0.04 0.00 0.03 26.7 6.8 106.7 -13.6 

-0.33 21.54 -0.01 0.03 0.00 0.01 8.7 -105.6 234.8 -29.1 
Overall: 0.01 0.12 0.00 0.13 


trend patocc occtrend  perpatocc] trend trend trend trend j oftrend dynamic thermo inter 
21.04 0.001 0.00 0.01 0.00 0.01 9.0 -79.0 177.8 
24.61 0.001 0.08 0.02 0.01 0.10 149.6 74.6 18.1 7.2 
22.87 0.000 -0.01 0.01 -0.01  -0.01 -21.3 98.0 -45.4 47.4 
21.74 -0.001 -0.02 -0.02 0.01 = -0.03 -37.2 59.7 68.2 -27.8 


Overall: 0.04 0.02 0.01 0.07 


trend patocc occtrend  perpatocc] trend trend trend trend j oftrend dynamic thermo inter 
0.002 0.04 0.03 0.00 0.07 S17 42.5 5.8 
0.001 -0.01 0.03 0.00 0.02 -45.3 156.9 -11.6 
0.003 0.01 0.07 0.01 0.08 6.6 86.3 7.1 


0.002 -0.01 0.03 0.00 0.02 -44.1 161.9 -17.8 
Overall: 0.03 0.16 0.00 0.19 


Patoce Mean TO05perpat Mean T05 Percent Percent Percent Percent 
trend patocc occtrend perpatocc] trend trend trend trend j oftrend dynamic thermo inter 
-0.11 19.43 -0.001 -0.01 -0.02 0.01 -0.02 18.4 40.2 152.7 -92.9 
0.37 22.29 -0.001 0.01 -0.01 0.00 0.00 0.8 -1637.5 1844.1 -106.5 
-0.37 22.69 -0.001 -0.03 -0.02 0.00 = -0.05 62.5 56.5 35.0 8.4 
0.12 26.60 -0.001 0.00 -0.02 0.00 = -0.02 18.3 -31.2 119.7 11.5 


Overall: -0.02  -0.07 0.01  -0.08 


Trends are partitioned for Europe (a), western Asia (b), and eastern North America (c) satellite-era summer hot extremes; central Asia ice-era winter cold extremes (d); eastern North America satellite-era autumn 
hot extremes (e); and eastern Asia satellite-era autumn cold extremes (f). From the left column moving right: (1) SOM node number; (2) trend in pattern occurrence (‘pat-occ’ d yr! yr~+); (3) mean pattern 
occurrence for the period (‘pat-occ’ d yr); (4) trend in temperature extremes per pattern occurrence (intensity of extreme occurrence) (‘ext-occ’ d pat-occ d~? yr~? yr~1); (5) mean temperature extreme per 
pattern occurrence (‘ext-occ’ d pat-occ d+ yr~ 1); (6) trend in the dynamic influences(‘ext-occ’ d yr yr +); (7) trend in the thermodynamic influences (‘ext-occ’ d yr? yr +); (8) trend in the interaction of dynamic 
and thermodynamic influences (‘ext-occ’ d yr! yr~ 1); (9) the total trend in extreme occurrence for each SOM pattern (‘ext-oce’ d yr! yr~1); (10) the per cent of the total trend in extreme occurrences that occur 
during each pattern (pattern trend percentages sum to 100%, meaning contributions from individual SOM patterns can be either positive or negative; that is, trend percentages greater than 100% indicate that 
other circulation patterns provide negative contributions); (11) the per cent of column 10 thatis due to dynamic influences; (12) the per cent of column 10 thatis due to thermodynamic influences; and (13) the per 
cent of column 10 that is due to interactive influences (dynamic, thermodynamic and interactive influence percentages for individual SOM patterns sum to 100%, meaning contributions from individual influences 
can be either positive or negative). The overall trends for the domain are presented below the individual SOM rows. All data are sourced from the NCEP-DOE-R2 reanalysis°°. 
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Sexual selection protects against extinction 
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Reproduction through sex carries substantial costs, mainly because 
only half of sexual adults produce offspring’. It has been theorized 
that these costs could be countered if sex allows sexual selection to 
clear the universal fitness constraint of mutation load’*. Under 
sexual selection, competition between (usually) males and mate 
choice by (usually) females create important intraspecific filters 
for reproductive success, so that only a subset of males gains 
paternity. If reproductive success under sexual selection is depend- 
ent on individual condition, which is contingent to mutation load, 
then sexually selected filtering through ‘genic capture” could offset 
the costs of sex because it provides genetic benefits to populations. 
Here we test this theory experimentally by comparing whether 
populations with histories of strong versus weak sexual selection 
purge mutation load and resist extinction differently. After evol- 
ving replicate populations of the flour beetle Tribolium castaneum 
for 6 to 7 years under conditions that differed solely in the 
strengths of sexual selection, we revealed mutation load using 
inbreeding. Lineages from populations that had previously experi- 
enced strong sexual selection were resilient to extinction and main- 
tained fitness under inbreeding, with some families continuing to 
survive after 20 generations of sib x sib mating. By contrast, 
lineages derived from populations that experienced weak or non- 
existent sexual selection showed rapid fitness declines under 
inbreeding, and all were extinct after generation 10. Multiple 
mutations across the genome with individually small effects can 
be difficult to clear, yet sum to a significant fitness load; our find- 
ings reveal that sexual selection reduces this load, improving popu- 
lation viability in the face of genetic stress. 

Sexual selection is a widespread evolutionary force giving rise to a 
striking diversity of sights, sounds and smells that filter reproductive 
success away from less competitive or attractive individuals, frequently 
at the expense of survival’. Sexual selection will operate to varying 
degrees whenever sexual reproduction exists, and its significance as 
a potent force profoundly influencing reproductive fitness of indivi- 
duals is long established’. In contrast, limited empirical work has been 
directed at measuring the consequences of sexual selection for the 
fitness of populations. This lack of attention is surprising for two 
reasons: first, because population viability is vital for biodiversity 
maintenance and ecosystem stability, especially under modern anthro- 
pogenic stress”*; and second, because it is predicted that the mainten- 
ance of costly sex as the dominant mode of reproduction might only be 
possible if it allows sexual selection to operate, reducing the universal 
handicap of mutation load**. 

Population or lineage health will always suffer at some level from 
mutation load—the difference in fitness between a (usually theoretical) 
mutation-free lineage and one that carries a load of deleterious muta- 
tions that are segregating in mutation-selection balance’’. This load 
exists because new mutations continually arise in all populations every 
generation, most of which will be deleterious'®. Haldane calculated that 
mutation load would be unexpectedly high'', owing to large numbers 
of loci across each genome presenting multiple targets to unavoidable 


mutation rates. Empirical estimates of mutation rate and load are hotly 
debated’, but we know, for example, that the average human lineage 
carries hundreds of deleterious loss-of-function mutations’, possibly 
thousands’. Natural selection will quickly remove mutations with large 
effects, but load persists through accumulation of mutations that 
have small individual effects, and/or exist as recessive alleles where 
their deleterious phenotypes are less frequently exposed to selection. 
Mutation load can therefore sum to a significant fitness constraint fora 
population which, because of its dispersed or concealed nature, is 
difficult to clear”. Sex could allow more effective purging of mutation 
load than asexual reproduction, if there are synergistic epistatic inter- 
actions between mutations’, so that their negative fitness impact is 
greater than strictly additive. However, the evidence that sufficient 
levels of synergistic epistasis exist remains equivocal’, so alternative 
explanations for the maintenance of costly sexual reproduction 
are sought. 

Sexual selection could be a key filter against mutation load if, as 
Darwin acknowledged, “sexual selection will have given its aid to 
ordinary selection”’*. This recognition that sexual selection places an 
additional, intraspecific filter on adaptive gene flow has been forma- 
lized by the idea of ‘genic capture”, which proposes that reproductive 
success in the face of competition and choice depends on most or all 
aspects of an individual’s condition. Thus, sexual selection will act on 
most loci across the genome, purifying deleterious alleles from indivi- 
duals within a lineage, and promoting fixation of advantageous ones, 
via three potentially connected routes. First, if competition or choice 
promotes non-random mating or fertilization success as an inverse 
consequence of mutation load, then deleterious dominant and recess- 
ive alleles will be under stronger purifying selection in lineages that 
experience sexual selection’. Second, if mutations are more deleterious 
in the competing sex than the choosing sex, which would occur if the 
competing sex suffers amplified reproductive success variance as a 
result of mutation load, then the equilibrium frequency of both dom- 
inant and recessive deleterious mutations will be lower within sexually 
selected populations, even accounting for synergistic epistasis in the 
competing sex’. Put simply, fathers should carry fewer mutations than 
all males*. Third, if female ability to maintain condition for reproduct- 
ive fitness is under selection from interlocus sexual conflict, then sex- 
ual selection could purge mutations through females'*’’. These routes 
for purging load via sexual selection provide theoretical explanations 
for how costly sex can persist as a dominant mode of reproduction~”. 

Few experiments have tested whether sexual selection removes dele- 
terious mutations, yielding inconsistent findings. After introduction of 
mutations, heightened sexual selection rescued fitness faster in 
Rhizoglyphus robini mites'®, Drosophila melanogaster flies'’ and 
Onthophagus taurus beetles'*. However, further work showed that 
sexual selection reduced mutation loads only for a subset of fitness 
traits in Drosophila’’. More detailed studies found no evidence that 
sexual selection could purge deleterious alleles from experimentally 
evolving lineages”, or restore fitness after mutation load had been 
induced, even after 60 generations”’; in fact, reproductive fitness within 
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mutated lines became higher when sexual selection had been mini- 
mized*’. These inconsistent results could be explained by interlocus 
sexual conflict’: while sexual selection might play a beneficial role in 
purging mutation load, direct short-term constraints on population 
productivity may also arise when female fitness is constrained by 
conflicting adaptations that promote only male reproductive poten- 
tial’. Thus, short-term fitness costs arising simultaneously from inter- 
locus sexual conflict could confound the measurement of longer-term 
fitness benefits arising from sexual selection’. To avoid this problem, 
we assayed mutation load after populations (from the same ancestry) 
had experienced almost 7 years of sole variation in the intensity of 
sexual selection (Extended Data Figs 1 and 2). Although no evidence 
for carry-over effects of interlocus sexual conflict exists in our model 
under these conditions"’, we also removed potential trans-generational 
effects by enforcing two generations of monogamous reproduction 
before beginning to assay fitness. To expose mutation load in each 
population, we enforced inbreeding using sib X sib pairings, tracking 
fitness changes down multiple family lines (Extended Data Fig. 3). 
Because mutations will more likely persist as recessive alleles that are 
less frequently exposed to selection, compared with dominant wild 
types, fitness depression is a normal consequence of inbreeding as 
homozygosity increases the expression frequency of deleterious reces- 
sives through partial dominance”. By tracking extinction rates and 
fitness declines down inbreeding lineages perpetuated by monogam- 
ous sibling pairings, we tested the hypothesis that sex allows sexual 
selection to generate significant benefits through the purging of muta- 
tion load”, while avoiding the concurrent confound of interlocus 
sexual conflict”. 

Experimental evolution lines began in 2005, with two different 
regimes (A and B) that both exposed replicate lineages to either ‘strong’ 
or ‘weak’ treatments of sexual selection, providing parallel independ- 
ent experiments to measure whether sexual selection purges mutation 
load. Regime A exposed populations to 54 generations of divergent 
adult operational sex ratios that were either male-biased (10 females to 
90 males) versus female-biased (90 females to 10 males), while regime 
B allowed polyandry (1 female to 5 males) versus enforced monogamy 
(1 female to 1 male) for 45 generations. All other conditions among 
lines and treatments within a regime were kept identical, including 
equalizing theoretical effective population sizes (see Methods, 
Extended Data Figs 2 and 4 and Supplementary Information for details 
confirming that heterozygosity was identical between treatments 
within either regime). The only difference between ‘strong’ and ‘weak’ 
treatments was therefore intensity of sexual selection during adult 
reproduction. 

Our findings clearly showed for both regimes that lineages derived 
from populations experiencing evolutionary histories of strong sexual 
selection resisted extinction and maintained fitness more effectively 
when mutation load was exposed (Figs 1 and 2 and Extended Data Figs 
5-7; all means presented in this article are arithmetic means + stand- 
ard errors). Over the 3-year extinction assay, families derived from 
male-biased populations survived 44% longer than families from 
female-biased populations (mean number of generations to extinction 
= 9.24 (+1.29 s.e.m.) versus 6.46 (+0.15); Fig. la). Families from 
polyandrous histories survived 37% longer on average than those from 
monogamous treatments (8.50 (+1.02) generations versus 6.21 
(+0.46); Fig. 1b). When we combined data from regimes A and B into 
one analysis (incorporating sexual selection as a fixed factor variable), 
the history of sexual selection remained a significant predictor of 
number of generations to extinction (z = —3.43, P < 0.001), and 
there was no difference in extinction rates between regimes A and B 
(z = —0.51, P = 0.611), nor a significant interaction (z = 0.44, P = 
0.660), revealing a consistent effect of sexual selection on extinction 
rates (Extended Data Fig. 5c). Overall, families derived from popula- 
tions evolving under histories of strong sexual selection survived for 
40% longer than those derived from weak sexual selection histories, 
giving an average survival time under mutation load exposure of 8.87 
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Figure 1 | Extinction trajectories under increasing inbreeding differ 
between family lines derived from strong (red squares) versus weak (blue 
circles) sexual selection histories. Each generation mean presents the average 
(+s.e.m.) proportion of surviving families for three independent lines per 
treatment. a, Regime A: male-biased (red) versus female-biased (blue) sexual 
selection histories. Each line is represented by 28 initial families (n = 84 total 
families for either treatment). b, Regime B: polyandrous (red) versus 
monogamous (blue) selection histories. Each line is represented by eight initial 
families (n = 24 total families for either treatment). Using parametric 
accelerated failure-time survival models with sexual selection treatment as a 
fixed effect, and incorporating correlated data within lines using a generalized 
estimating equation approach, we identified significantly lower extinction rates 
in populations that had previously experienced strong histories of sexual 
selection (Extended Data Fig. 5a, regime A: z = 3.40, P < 0.001; Extended Data 
Fig. 5b, regime B: z = 2.81, P = 0.005). 


(£0.37) versus 6.33 (+0.13) generations respectively. All 108 initial 
families derived from weak sexual selection histories ceased to produce 
offspring beyond the 10th generation of inbreeding, whereas 8 of the 
108 families from the strong sexual selection histories were still pro- 
ducing offspring after 20 generations of inbreeding. 

Declines in reproductive fitness exhibited similar patterns to the 
extinction rates they underpinned. Having removed sexual selection 
history via enforced monogamy in regime B, baseline reproductive 
fitness even without inbreeding is substantially reduced (Fig. 2b shows 
that a polyandrous history improves baseline fitness by ~30%). Most 
importantly, significant interactions between inbreeding and fitness 
decline revealed that fitness declines were much faster for families 
derived from treatments experiencing weak sexual selection histories 
in both regimes (z = 13.82 and 10.56 for A and B respectively, 
P < 0.001 for both; Fig. 2, Extended Data Table 1 and Extended 
Data Figs 6 and 7). To control for the possibility that failure-to-mate 
could have created differences between treatments, we repeated these 
analyses including only pairs that produced offspring: identical pat- 
terns of significant interactions between sexual selection history, 
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Figure 2 | Reproductive fitness declines under increasing inbreeding of 
families derived from strong (red squares) versus weak (blue circles) sexual 
selection histories differ in magnitude and rate. a, Regime A: male-biased 
(red) versus female-biased (blue) selection histories. b, Regime B: polyandrous 
(red) versus monogamous (blue) histories. Each generation mean presents the 
average (+s.e.m.) number of offspring produced under standardized 
conditions for three independent lines per treatment. Lines are represented by 
eight initial families (n = 24 total families per treatment), and two breeding 
pairs per family. Average fitness under identical conditions but without 
inbreeding (‘Ref’) plotted for reference. Using generalized linear mixed models, 
and accounting for overdispersion as well as nesting non-independent replicate 
families within lines as random effects, we found in both regimes that 
inbreeding was a highly significant predictor of fitness (regime A: z = —21.34, 
P < 0.001; regime B: z = —21.17, P < 0.001), and that sexual selection 
treatment history was also a significant predictor of overall differences in fitness 
(regime A: z = —3.75, P< 0.001; regime B: z = —1.97, P = 0.048; see Extended 
Data Table 1 and Extended Data Figs 6 and 7). 


inbreeding and fitness decline remained (z = 5.73 and 6.36 for A and B 
respectively, P < 0.001 for both; Extended Data Figs 6c, d and 7c, d). 

Results from this 10-year experiment provide compelling empirical 
support for the complementary models given in refs 2 and 3, which 
argued that costs of sex' could be offset by population genetic benefits 
derived from sexual selection. The most obvious mechanism explain- 
ing these differences is that heightened sexual selection, via “genic 
capture’™*°, more effectively strips out, or prevents, mutation load from 
becoming fixed in a population, strengthening its ability to withstand 
stress. Hemizygous selection could make sex-linked loci especially 
prone to this process™, but since the T. castaneum X chromosome 
makes up only ~6% of the genome”, we expect purging of load to 
have occurred on the autosomes too. There is emerging evidence that 
sexual selection can profoundly shape the genome, with feminized 
patterns of sex-linked expression in the transcriptomes of both sexes 
after experimental evolution under monogamy in Drosophila’®. Our 
findings indicate that sexual selection also acts to purge mutation load, 


472 | NATURE | VOL 522 | 25 JUNE 2015 


even in populations that we expect to be adapting close to their natural 
fitness peaks’. Stronger sexual selection will drive greater variance in 
reproductive success, so that the average father should carry fewer 
deleterious mutations than the average male’, and perhaps also the 
average mother if interlocus sexual conflict constrains female fitness’’. 

Within the promiscuous Tribolium model”* (Extended Data Fig. 8 
and Methods), both pre- and post-copulatory processes for winning or 
controlling fertilizations will operate through broad behaviours and 
physiology within the whole organism, down to competition and 
choice at the gamete level. Genic capture predicts that individual suc- 
cess across this continuum will depend on overall condition and genes 
at many loci®. Under monogamy, where sexual selection has been 
removed, populations suffer the constraints of sexual reproduction, 
but none of the benefits of sexual selection, explaining the lowered 
baseline fitness even without inbreeding, as well as more rapid fitness 
declines and heightened extinction rates under inbreeding (Figs 1 and 
2 and Extended Data Figs 5-7). Although we exposed mutation load 
for experimental measurement through inbreeding, the intense and 
diverse demands from selection across multiple generations in the 
natural environment, where populations are more likely to be dis- 
placed from their fitness peaks’’, are likely to expose even greater 
fitness differentials due to load variance. After only one generation 
of inbreeding, for example, mean fitness between strong versus weak 
sexual selection histories differs by 20% and 40% in regimes A and B 
respectively (Fig. 2). Our results indicate, as demands rise upon popu- 
lations that have been depleted and fragmented, and displaced further 
from their fitness peaks”’ via increasing anthropogenic stress”, that 
sexual selection could be an important force protecting species or 
populations from the extinction vortex”®. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Experimental evolution lines. Beetles were of the widely used Georgia-1 (GA1) 
‘wild type’ strain, originally collected from stored corn in 1980, and since cultured 
by the Beeman laboratory (United States Department of Agriculture), maintained 
under standard conditions. Adult virgin beetles from the same ancestral GA1 
population were randomly allocated to begin their respective treatments within 
both regimes. Each generation, male and female pupae were separated and placed 
in fresh fodder (organic flour, yeast (10:1) and oats) for 10 days to allow adult 
emergence and sexual maturation’*”**’. Then, using the controlled sex ratios in 
the different sexual selection treatments, mature adults were placed in ad libitum 
fodder for 7 days to compete, choose, mate, oviposit and therefore reproduce 
under divergent intensities of sexual selection’****!. After 7 days, adults were 
removed and the eggs and larvae (typically about 70 offspring per female) were 
left to develop under standardized conditions with equal offspring densities by 
maintaining ad libitum levels of food in proportion to the number of offspring. 
These conditions were maintained for about 7 years, allowing the application of 
strong versus weak sexual selection in a total of 12 independent lines through sole 
variation in the adult operational sex ratio, while equalizing effective population 
size within regimes. Two parallel experimental evolution regimes were run, 
regimes A and B, which both applied treatments that created contrasting diver- 
gences in sexual selection intensities (see Extended Data Fig. 2 for visual details). 
Under regime A, the male-biased treatment provided 10 females with choice 
among 90 males, and simultaneous competition between the 90 males to fertilize 
the 10 females, whereas in female-biased treatments the reverse scenarios applied. 
Six independent lines (three per treatment) were maintained for 54 generations 
under regime A. Regime B generated divergence by enforcing either monogamy 
(20 replicate pairs per line), or allowing reproduction to be achieved through 
competition between five males under polyandry (12 replicate groups per line). 
Thus, polyandry provided each female with a choice among five males, and com- 
petition between five males for just one female, whereas monogamy completely 
removed all female choice and male-male competition. Six independent lines 
(three per treatment) were maintained for 45 generations under regime B. After 
each adult reproduction period and sexual selection treatment, offspring from 
replicate families within each independent line were pooled under standardized 
densities for larval development and genetic mixing to the next generation. 
Quantifying sexual selection intensity. Divergence in the strength of sexual 
selection between the different treatments was estimated using a male mating 
potential assay, in which males from the ancestral GA1 control stock population 
were each provided with a series of 84 unmated females. Individual were sexed as 
pupae and matings took place after 10 days of adult maturation (as per the sexual 
selection treatments). Males were placed individually in microcentrifuge tubes 
with approximately 1 ml of flour topped with oats; 24 h before their trial they 
were identified with a small dot of white correction fluid on their thorax'>*?!. 
Females were placed in groups of six per Petri dish with 7 g of flour topped with 
oats. Mating assays were performed by placing individual males into Petri dishes 
containing standard fodder and six females, moving males to a new group of six 
females every 12 h for 7 days (12 X 7 = 84 females given to each male over the 
week). After each 12 h mating period, females were placed singly into Petri dishes 
with standard fodder for 7 days to lay eggs. Any offspring were then allowed to 
develop for 35 days before being frozen, and their presence noted to score suc- 
cessful matings. Eleven males were assayed, and the average number of success- 
fully fertilized females across this 1 week period per male was 50 (+3 s.e.m.); 
Extended Data Fig. 8), providing extreme divergence in potential levels of female 
polyandry (and therefore sexual selection) between strong and weak sexual selec- 
tion treatments. Thus, in regime A the 10 females at each adult generation in the 
male-biased treatment were potentially exposed to 90 X 50 = 4,500 successful 
matings, or 450 matings per female per week involving all 90 males. Whereas the 
90 females in the female-biased treatment potentially experienced 500 matings (10 
X 50), or 5.6 per female per week, enabling each female to be mated, on average, by 
5.6 of the males available. In the monogamous treatment, we removed sexual 
selection altogether as only one male was available for mating, while the poly- 
androus treatment generated a fivefold increase in potential for pre- and post- 
copulatory male-male competition and female choice through the availability of 
five males, all of which could mate with the female. 

Effective population sizes. To avoid differential inbreeding that could have sub- 
sequently influenced our extinction assay, we equalized the theoretical effective 
population size, N., for mixed adult sex ratios (N.= we) (ref. 32) in our 
divergent sexual selection treatments; thus, in regime A N. = 36 for both male- 
and female-biased treatments, and in regime B N, = 40 for both monogamous and 
polyandrous treatments. It is important to note that strong sexual selection treat- 
ments with male-biased or polyandrous structures may translate into reduced 
realized N, due to male success skew under strong sexual selection (where com- 
petition and choice could allow reproductive success by a smaller subset of males 


in the breeding population). Thus, our experimental design is conservative to any 
influence of N., because we test the prediction that lineages with histories of strong 
sexual selection (and therefore potentially lower N.) should demonstrate resist- 
ance to extinction and reduced rates of fitness decline under inbreeding (because 
they have more effectively purged mutation load). If N. is lower in these strong 
sexual selection populations, we would expect this to result in reduced hetero- 
zygosity at the start of inbreeding, resulting in lower initial fitness and therefore 
faster extinction rates (yet we observe the converse in Figs 1 and 2). As an addi- 
tional, third, check for N. confounds, we also directly measured heterozygosity and 
allelic richness in our lines to establish that differential inbreeding had not 
occurred under different sexual selection treatments (methods outlined below). 
By screening 628 individuals representing the different sexual selection treatments 
in both regimes A and B at 13 microsatellite loci (Supplementary Information), we 
were able to confirm that heterozygosity and allelic richness showed no differences 
between ‘weak’ or ‘strong’ sexual selection histories (Extended Data Fig. 4). 
Exposing mutation load. Having been subjected to experimental evolution that 
applied strong versus weak sexual selection under equal effective population sizes, 
we then exposed the mutation load carried in these populations by inbreeding 
down multiple replicate family lines to expose deleterious recessives, and tracking 
fitness decline to extinction as a result of inbreeding depression through partial 
dominance”’. No statistical methods were used to predetermine sample size, which 
was fundamentally controlled by the number of independent selection lines that 
had been maintained for 6 to 7 years. At the end of experimental evolution, we 
created multiple full-sib families using monogamous crosses of randomly chosen 
unmated males and females within each independent selection line, which simul- 
taneously removed any transgenerational interlocus sexual conflict effects, and 
from which full-sib offspring were then used to continue a total of 216 family lines 
down increasing inbreeding coefficients and homozygosity (Extended Data Fig. 3). 
There were thus two generations of monogamy applied before fitness assays began, 
eliminating any carry-over effects of sexual selection or interlocus conflict. As 
levels of inbreeding increased, we tracked extinction rates and fitness declines 
for multiple families within each of the 12 independent lines from the original 
sexual selection treatments. In regime A, we created 28 full-sib families in each 
independent line (Mrotal = 28 families * 3independent lines % 2sexual selection treatments = 
168 families), allowing extinction rates to be measured within each of the three 
independent lines in either of the two sexual selection treatments. When sibling 
pairs failed to produce female and male offspring that survived to adulthood to 
enable continuation of the line, it was recorded as extinct (see Extended Data Fig. 
3). In addition to extinction, we also measured reproductive fitness in a subset of 
these families, assayed as the average number of offspring produced by two ran- 
domly chosen sib X sib pairs within eight of the families per line (Extended Data 
Fig. 3). Identical protocols for regime B were followed, except that both extinction 
rates and average fitness were measured across eight families per independent line 
(Neotal = 8 families x 3independent lines * 2 sexual selection treatments — 48 families). 
Blinding was not performed as the protocol did not permit biasing to affect results. 
Inbreeding was continued for 20 generations of sib X sib matings (which operated 
over 3 years), by which time Wright’s inbreeding coefficient F had increased to 
0.986. After 20 generations of inbreeding, 208 of the 216 initial lines had gone 
extinct, with the eight survivors all derived from strong sexual selection histories. 
Baseline fitness. At four time-points throughout the extinction assay (parental, 
F,, Fs and F2, generations), we assayed ‘baseline’ fitness (without inbreeding) of 
standard monogamous crosses from the different sexual selection treatments, 
using identical protocols to fitness measures applied for the inbred crosses. 
Pupae from each of the three independent lines within each of the four different 
selection treatments were separated into single-sex groups and isolated for 10 days 
to allow adult eclosion and sexual maturation. Twenty randomly chosen male- 
female pairs were then established for each of the three independent lines within 
each of the four sexual selection treatments. Each pair was placed into a7 ml plastic 
vial, with ad libitum food and oats, and allowed to mate and oviposit for 7 days. 
After this, the flour from each vial, containing the eggs laid during the 7 day mating 
period, was transferred to a Petri dish containing a further 10 g of fodder, and the 
eggs allowed to develop to adult eclosion (~35 days). Fitness was scored as the 
total number of adults produced from each cross across 7 days of mating and 
oviposition. Data from previous work'* where male-female pairs were allowed to 
interact for 7 days shows (a) that the first week of oviposition produces >25% of 
the total female reproductive fitness for such male-female interactions, and (b) 
that female fitness over the first week significantly correlates with total female 
reproductive fitness (r = 0.48, P = 0.008, n = 29 pairs). Average baseline fitness 
values are presented for reference in Fig. 2a, b (‘Ref). 

Extinction analyses. Extinction rates were analysed using the ‘survival’ package*’ 
in R version 3.1.0 (ref. 34). To assess whether sexual selection history influenced 
extinction rate after 20 generations of sib X sib inbreeding for either regimes A or 
B, we analysed generation to extinction using a parametric accelerated failure-time 
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survival model with Weibull baseline hazard distribution, taking into account 
right-censored data from families that were still alive at the end of the experiment. 
Generation of extinction was modelled with sexual selection treatment as a fixed 
effect, and the shape and scale of the underlying Weibull hazard distribution was 
allowed to vary by treatment. We also ran a combined analysis where both experi- 
mental evolution regimes were incorporated into the model, with level of sexual 
selection (strong versus weak), comparison (regime A or B) and their interaction 
as fixed effects. In all survival analyses, correlated data within lines were incorpo- 
rated using a generalized estimating equation approach. Akaike’s information 
criteria and graphical interpretation of the complementary log-log survival plots 
were used to confirm appropriateness of the specified Weibull distribution; no 
violations of model assumptions were detected. Model fits are presented in 
Extended Data Fig. 5a-c. 

Reproductive fitness analyses. We tested for a relationship between fitness and 
generation of inbreeding by fitting generalized linear mixed models (GLMMs) 
using the glimmADMB package” in R 3.0.3 (ref. 34). We assumed a negative 
binomial error structure which added the parameter (k) to the variance mean 
relationship, allowing us to account for the overdispersion in our data introduced 
by a large number of zero fitness observations. GimmADMB has two options for 
fitting the relationship between the mean and the variance. These are family = 
‘nbinom1’, which assumes the variance = k X mean, and family = ‘nbinom2’, 
which assumes the variance = mean(1 + mean/k). We fitted models with each of 
these options in turn and then selected the parameterization that provided the 
lowest AIC score and therefore accounted for the greatest amount of variance in 
our data set. To account for the nested nature of the experimental design and the 
non-independence between replicate families, random effects were included in the 
model as replicate nested within family nested within line. Separate GLMMs were 
fitted to the regime A (male-biased versus female-biased) comparison and the 
regime B (polyandry versus monogamy) comparison (Extended Data Figs 6 and 
7). To ensure that failure-to-mate was not a reason for differences in fitness declines 
between treatments, we repeated the analyses including only those families that 
produced some offspring each generation (Extended Data Figs 6c, d and 7c, d). 
Microsatellite analyses. A total of 628 individuals from the 12 independent lines 
representing all four sexual selection treatments (strong versus weak in regimes A 
and B) were genotyped at 13 loci conforming to Hardy-Weinberg equilibrium 
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(Supplementary Information). Scored genotypes for all individuals were analysed 
in Arlequin version 3.5 (ref. 36) to obtain allelic richness and observed and 
expected heterozygosity. We tested for differences in heterozygosity across selec- 
tion treatments using linear mixed models, implemented in lme4 version 1.1-6 
(ref. 37) package in R™, with significance testing performed using the package 
ImerTest version 2.0-6 (ref. 38). Heterozygosity was included as a response vari- 
able with selection treatment as a fixed factor and locus as a random factor. To 
assess whether independent line identity had a significant effect on genetic divers- 
ity beyond that of selection treatment, a second mixed model was tested with the 
same variables plus line identity as an additional random factor. The two models 
were then compared using likelihood ratio tests. All model residuals were tested for 
normality and no violations of model assumptions were found. Estimated hetero- 
zygosity from the linear mixed effect models and the associated standard error is 
plotted in Extended Data Fig. 4. Microsatellite data are available from the Dryad 
Digital Repository at http://dx.doi.org/10.5061/dryad.86750. 

Code availability. Scripts of analyses and code used for figure production are 
available upon request from M.J.G.G. 
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Extended Data Figure 1 | Experimental rationale for purging and then monogamous sib X sib pairings, also eliminating concurrent confounds of 
exposing mutation load. Having been changed by strong (+SS, red) versus _interlocus sexual conflict. Populations with reduced mutation load as a result of 
weak (—SS, blue) histories of sexual selection, while under equal influences of _ histories of strong sexual selection are predicted to resist extinction (survival, s) 


natural selection (NS), variation in mutation load residing in the form of and maintain fitness (f) under continuous inbreeding (i). 
recessive alleles is exposed via inbreeding. Inbreeding was enforced through 
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Extended Data Figure 2 | Experimental evolution protocols for regimes mixing within the replicate lines at the egg/larval/pupal stages. From the start, 
A and B. Contrasting intensities of strong (red) versus weak (blue) sexual each treatment was replicated to create three independent lines. Regime A 
selection were imposed upon each generation of adult reproduction, while (a) applied contrasting sexual selection by varying adult operational sex ratio, 


equalizing effective population size within a regime, and allowing full genetic while regime B (b) enforced monogamy to compare against polyandry. 
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Extended Data Figure 3 | Extinction and fitness decline protocols. 
Inbreeding in family lines was performed via sib X sib crosses for up to 20 
generations across 3 years. To measure extinction (a), a family was considered 
extinct when it failed to produce offspring, or offspring were of the same sex 
(which occurred in only 9 out of 216 family lines, indicating no sex-specific pre- 
adult mortality by treatment). In regime A, extinction data were collected from 
28 initial families per line, three lines per sexual selection treatment, comparing 
both strong versus weak treatments (n = 168 total family lines). In regime B, 
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extinction data were collected from eight initial families per line, three lines per 
sexual selection treatment, comparing both strong versus weak treatments 

(n = 48 total family lines). To measure fitness decline (b), two additional sib x 
sib pairs per family per generation were bred to estimate reproductive fitness in 
every generation by counting number of offspring produced (see Methods). In 
both regimes A and B, fitness data were collected from eight initial families per 
line, three lines per sexual selection treatment, and both strong versus weak 
treatment contrasts in each. 
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Extended Data Figure 4 | Estimated heterozygosity (+s.e.m.) does not 
differ between experimental evolution sexual selection treatments within 
regime A (left) and regime B (right). Linear mixed effect modelling showed 
the estimated heterozygosity of the male-biased selection treatment (M: Hes = 
0.312, t = 9.468) is not significantly different from that of female-biased (F: Hest 

0.318, t = 9.295, P = 0.863), but is significantly different from monogamous 
and polyandrous treatments (Mo: Hest = 0.199, t = 6.453, P = 0.003; Po: Hest = 
0.197, t = 6.397, P = 0.003). The estimated heterozygosities of monogamous 
and polyandrous treatments are not significantly different (P = 0.956) (see 
Methods). 
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Extended Data Figure 5 | Concordance between raw data and model fitin — male-biased (red) versus female-biased (blue) sexual selection treatments; 


extinction analyses. Survival curves of raw data (thick and dotted lines) (b) regime B, polyandrous (red) versus monogamous (blue); (c) regimes A and 
overlaid on model fit (shaded areas with mean curves and 95% confidence B combined into a single analysis. See Fig. 1 and the main text for results of 
intervals). Survival of families derived from strong (red, solid line) or weak statistical analyses, and Methods and Extended Data Figs 2 and 3 for details of 
(blue, dotted line) sexual selection treatment histories differed: (a) regime A, protocols, methods and experimental design. 
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Extended Data Figure 6 | Regime A. Boxplots of the relationships between 
fitness and inbreeding generation for the male-biased (a and c) versus the 
female-biased (b and d) treatments. Curves show the predicted relationships 
between reproductive fitness and inbreeding generation from the GLMMs, and 
the narrow red and blue shadows show the 95% confidence intervals predicted 


from the fixed effects. Horizontal bars indicate medians, boxes indicate 


interquartile ranges, whiskers indicate minimum and maximum values, and 
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circles indicate outliers (values 1.5 times higher or lower than the first and third 
quartiles, respectively). Comparison of a versus b identifies the difference in 
total fitness declines between strong versus weak sexual selection histories in 
regime A, while c versus d identifies the same difference in decline for fitness 
but only for the sibling pairs that produced at least some offspring (that is, 
omitting zero fitness values that may have resulted from a failure to mate). See 
Fig. 1, main text and Extended Data Table 1 for results of statistical analyses. 
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Extended Data Figure 7 | Regime B. Boxplots of the relationships between circles indicate outliers (values 1.5 times higher or lower than first and third 
fitness and inbreeding generation for the polyandrous (a and c) versus the quartiles, respectively). Comparison of a versus b identifies the difference in 
monogamous (b and d) treatments. Curves show the predicted relationships _ total fitness declines between strong versus weak sexual selection histories in 
between reproductive fitness and inbreeding generation from the GLMMs, and __ regime B, while c versus d identifies the same difference in decline for fitness but 
the narrow red and blue shadows show the 95% confidence intervals predicted _ only for the sibling pairs that produced at least some offspring (that is, omitting 
from the fixed effects. Horizontal bars indicate medians, boxes indicate zero fitness values that may have resulted from a failure to mate). See Fig. 1, 
interquartile ranges, whiskers indicate minimum and maximum values, and main text and Extended Data Table 1 for results of statistical analyses. 
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successfully inseminated 50 females on average (+s.e.m.). Six virginfemales _ Data Fig. 2), males successfully inseminated and generated offspring from an 
were allocated to individual GA1 control stock males (n = 11) every12hfor7 average of 50 females (see Methods). 
days, providing males with 84 potential mates. Over this 1 week period 
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Extended Data Table 1 | Fixed-effect parameter estimates from negative binomial GLMMs of the relationship between fitness and 
generation of inbreeding for male-biased and female-biased treatments (regime A), and polyandrous and monogamous treatments 
(regime B), and their statistical interactions. See Methods for details of replication and sample sizes. 


Comparison Estimate Std. Error z value Pr(> |z|) 
Intercept 5.02 0.17 28.23 <0.001 
Regime A . 
Male-biased Inbreeding -0.63 0.03 -21.34 <0.001 
oe Treatment (Female- or Male-biased) -0.91 0.24 -3.75 <0.001 
Female-biased 
Inbreeding*Treatment (Male-biased) 0.42 0.03 13.82 <0.001 
Intercept 4.48 0.23 19.39 <0.001 
Regime B : 
Polyandrous Inbreeding -0.53 0.03 -21.17 <0.001 
vs. 
Monogamous Treatment (Monogamy or Polyandry) -0.63 0.32 -1.97 0.048 
Inbreeding*Treatment (Polyandry) 0.29 0.03 10.56 <0.001 
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PPAR-a and glucocorticoid receptor synergize to 
promote erythroid progenitor self-renewal 


Hsiang-Ying Lee'*, Xiaofei Gao'*, M. Inmaculada Barrasa', Hu Li’, Russell R. Elmes', Luanne L. Peters? & Harvey F. Lodish'* 


Many acute and chronic anaemias, including haemolysis, sepsis 
and genetic bone marrow failure diseases such as Diamond- 
Blackfan anaemia, are not treatable with erythropoietin (Epo), 
because the colony-forming unit erythroid progenitors (CFU-Es) 
that respond to Epo are either too few in number or are not sens- 
itive enough to Epo to maintain sufficient red blood cell produc- 
tion’ ’. Treatment of these anaemias requires a drug that acts at an 
earlier stage of red cell formation and enhances the formation of 
Epo-sensitive CFU-E progenitors. Recently, we showed that gluco- 
corticoids specifically stimulate self-renewal of an early erythroid 
progenitor, burst-forming unit erythroid (BFU-E), and increase 
the production of terminally differentiated erythroid cells’. Here 
we show that activation of the peroxisome proliferator-activated 
receptor a (PPAR-a) by the PPAR-a agonists GW7647 and fenofi- 
brate synergizes with the glucocorticoid receptor (GR) to promote 
BFU-E self-renewal. Over time these agonists greatly increase pro- 
duction of mature red blood cells in cultures of both mouse fetal 
liver BFU-Es and mobilized human adult CD34* peripheral blood 
progenitors, with a new and effective culture system being used 
for the human cells that generates normal enucleated reticulocytes. 
Although Ppara~’~ mice show no haematological difference from 
wild-type mice in both normal and phenylhydrazine (PHZ)-induced 
stress erythropoiesis, PPAR-a agonists facilitate recovery of wild- 
type but not Ppara~’~ mice from PHZ-induced acute haemolytic 
anaemia. We also show that PPAR-g alleviates anaemia in a mouse 
model of chronic anaemia. Finally, both in control and corticos- 
teroid-treated BFU-E cells, PPAR-a@ co-occupies many chromatin 
sites with GR; when activated by PPAR-a agonists, additional 
PPAR-a is recruited to GR-adjacent sites and presumably facili- 
tates GR-dependent BFU-E self-renewal. Our discovery of the role 
of PPAR-@, agonists in stimulating self-renewal of early erythroid 
progenitor cells suggests that the clinically tested PPAR-a agonists 
we used may improve the efficacy of corticosteroids in treating 
Epo-resistant anaemias. 

The therapeutic effect of glucocorticoids in treating Epo-resistant 
anaemias such as Diamond-—Blackfan anaemia (DBA) is well docu- 
mented, although steroid therapy has severe side effects that limit its 
use’*!’, Given the physiological importance and attractive drug targets 
of nuclear receptors (NRs)'*"””, and in the hope of identifying other 
small-molecule drugs that can either individually or in combination 
with glucocorticoids expand BFU-E progenitors, we analysed the 
expression of 49 NR genes during mouse BFU-E self-renewal promoted 
by dexamethasone (DEX)"°. Although the expression of most NR genes 
was unchanged, seven were upregulated and three downregulated by 
more than twofold (Supplementary Table 1). Among these, PPAR-o 
changed the most: a fivefold induction (Extended Data Fig. 1a). While 
most NR agonists and antagonists had no significant effects on eryth- 
roid production in cultures of primary mouse BFU-Es, two PPAR-« 
agonists, GW7647 and fenofibrate, synergized with DEX to increase the 


output of erythroid cells (Supplementary Table 2 and Fig. 1a). In the 
absence of corticosteroids, neither GW7647 nor fenofibrate affected 
erythropoiesis. Fenofibrate is a US Food and Drug Administration 
(FDA)-approved drug primarily used to treat hypercholesterolaemia 
and hypertriglyceridaemia’*, and GW7647 is a PPAR-« agonist with 
higher receptor binding affinity and specificity (half-maximum effec- 
tive concentration (EC;,9) = 6 nM)”. 

When added together with DEX, either GW7647 or fenofibrate led 
to a ~150-fold increase in erythroblast production, 3-5-fold greater 
than DEX alone (Fig. 1a). At the end of the culture, virtually all of the 
cells were erythroblast cells (Extended Data Fig. 1b). Importantly, the 
expression of Ppara declines markedly from the BFU-E to the CFU-E 
stage, consistent with the decline of DNase I hypersensitivity of the 
Ppara promoter (Extended Data Fig. Ic, d). 

Neither GW7647 nor fenofibrate were able to increase eryth- 
roblast production from isolated mouse CFU-E cells, suggesting a 
specific function of PPAR-« agonists on BFU-Es (Extended Data 
Fig. le). GW7647 also synergized with lower doses of DEX in pro- 
moting erythroid cell production (Fig. 1b). Importantly, the effects 
of GW7647 and fenofibrate require the PPAR-« receptor, since 
BFU-E cells isolated from Ppara '~ mice” failed to respond to their 
treatment (Fig. 1a). 

Added together with DEX, 10 IM of either GW7647 or fenofibrate 
significantly increased BFU-E colony numbers and maintained high 
BFU-E numbers over one additional day relative to that achieved by 
DEX alone (Fig. 1c). Concentrations of GW7647 as low as 100 nM 
effectively synergized with 100 nM DEX in promoting BFU-E self- 
renewal (Extended Data Fig. 1f). 

Blood from Ppara ’~ mice exhibits no haematological abnormal- 
ities, indicating that PPAR-« is not necessary for normal erythropoiesis 
(Supplementary Table 3). Consistent with this, the PPAR-o antagonist 
GW6471 does not interfere with DEX-promoted BFU-E self- 
renewal, indicating that DEX does not promote BFU-E self-renewal 
through activating the transcriptional activity of PPAR-« (Extended 
Data Fig. 1g). 

Adding GW7647 increases total cell number by an additional fourfold, 
yielding a 120,000-fold expansion in our newly developed synchronized 
human CD34* erythroid culture system (Supplementary Discussion 
and Fig. 2a). In these cultures the numbers of BFU-Es and CFU-Es are 
highest during days 5-8 (Fig. 2b and Extended Data Fig. 2a). Addition of 
GW7647 with DEX increased total BFU-E numbers, which in turn sub- 
sequently increased total CFU-E numbers in the culture. Similar to 
mouse BFU-E cultures, GW7647 synergized with very low concentra- 
tions of DEX in stimulating erythroid expansion of human CD34" cells 
(Extended Data Fig. 2b). Addition of DEX and GW7647 affects express- 
ion of many key erythroid genes (Supplementary Discussion). 

As expected, knocking down PPAR-« abrogated the ability of 
GW7647 to stimulate production of BFU-E cells or the total number 
of erythroid cells at the end of the culture (Fig. 2c and Extended 
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Figure 1 | PPAR-a signalling synergizes with GR to promote self-renewal of 
mouse BFU-E erythroid progenitors. a, Wild-type (WT; left) and Ppara ‘~ 
(right) BFU-E cells from embryonic day (E)14.5 mouse fetal livers were isolated 
and cultured in serum-free erythroid liquid expansion (SFELE) medium with 
the indicated treatment. Cell numbers were counted every 3 days. b, Mouse 
BFU-E cells were cultured in DEX + 10 uM GW7647 as indicated. Total 
erythroid cell numbers were counted at day 9. c, Colony-forming assays were 
conducted to determine BFU-E colony numbers from 100 mouse BFU-E cells 
cultured under the indicated conditions. Colony-forming assays were 
performed at 24-h intervals. Data are mean + s.d. from three biological 
replicates; *P < 0.05, **P < 0.01, ***P < 0.001, Student’s t-test. 


Data Fig. 2c). Consistent with the absence of a role for PPAR- in 
normal haematopoiesis, knocking down PPAR-« in human CD34* 
cells did not affect the ability of DEX to stimulate BFU-E production or 
production of erythroid cells. Thus, in the human as well as in the 
mouse, only ligand-activated, not unactivated, PPAR-o synergizes 
with GR to promote BFU-E self-renewal during erythroid differenti- 
ation. We note that our cultures are highly synchronous on the basis of 
cell surface marker expression (Extended Data Fig. 2e) as well as 
morphology (Fig. 2d). Neither DEX nor GW7647 had any effects on 
terminal differentiation or enucleation (Extended Data Fig. 2h). 
GW/7647 significantly increased the number of both BFU-E and CFU- 
Es in our CD34* cell culture system after RPS19 knockdown, which 
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Figure 2 | Activation of PPAR-a signalling increased erythroid cell 
expansion in the ex vivo human CD34" erythroid culture system. a, Human 
CD34 cells were cultured as described in Methods. Total cell numbers were 
quantified. b, BFU-E colony numbers were quantified by plating 1,000 cells at 
various time points during days 0-9 of the human CD34” erythroid culture on 
methylcellulose. c, BFU-E numbers (left) or total cell number (right) from 
control or PPAR-o-knockdown human CD34* cells cultured under the 
indicated condition were counted. shRNA, short hairpin RNA. d, Benzidine- 
Giemsa staining demonstrating cell morphology of the ex vivo human CD34* 
erythroid differentiation system. *P < 0.05, **P < 0.01, Student’s t-test. 
Data are mean + s.d. from three biological replicates. 


recapitulates RPS19 haploinsufficiency in DBA patients”’’? (Extended 
Data Fig. 3a). Addition of GW7647 substantially increased the fraction of 
CD71" cells at day 9 of culture after RPS19 knockdown as well as total 
cell numbers and the fraction of CD235a* cells at day 21 (Extended Data 
Figs 3b, c). Taken together, our data suggest that GW7647 facilitates 
BFU-E self-renewal and production of immature erythroid progenitors 
in both wild-type and RPS19-knockdown human cells. 

We next tested the function of GW7647 in a PHZ-induced hae- 
molytic anaemia mouse model in which the endogenous corticosteroid 
level becomes markedly increased''. Wild-type mice were treated with 
either dimethylsulfoxide (DMSO) or GW7647 for 3 days followed by 
treatment with PHZ; they were then injected with DMSO or GW7647 
for another 7 days (Supplementary Discussion and Extended Data Fig. 
4a). Treatment with GW7647 resulted in significantly higher levels of 
haemoglobin, red blood cell numbers, and haematocrit after PHZ 
injection compared to in control mice (Fig. 3a). In contrast, white 
blood cell counts were similar in the two groups (data not shown). 
Importantly, the function of GW7647 during stress erythropoiesis was 
dependent on PPAR-o, as Ppara ’ mice failed to respond to 
GW7647 treatment (Extended Data Fig. 4b). 

We also tested the ability of PPAR-« agonists to stimulate red cell 
production in a mouse model of chronic anaemia, ‘neonatal anaemia’ 
(Nan) mice (Supplementary Discussion). While BFU-E numbers in 
spleens of Nan/+ mutant mice are similar to or slightly higher than 
that in wild-type mice, the average number of BFU-Es in the spleens of 
GW7647-injected Nan/+ mutant mice are higher than that in 
untreated mice (Extended Data Fig. 5b). Importantly, GW7647 injec- 
tion increased haemoglobin level, haematocrits and red blood cell 
numbers in these anaemic mice (Fig. 3b), suggesting that the PPAR- 
a agonist alleviates anaemia in Nan/+ mutant mice by increasing 
BFU-E numbers in the spleen. GW7647 does not have any significant 
effects on either platelet or white blood cell numbers in peripheral 
blood from Nan/+ mice (Extended Data Fig. 5c). 

To understand the molecular mechanism by which PPAR-« syner- 
gizes with GR to promote BFU-E self-renewal, we conducted chro- 
matin immunoprecipitation followed by sequencing (ChIP-seq) 
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Figure 3 | PPAR-o agonist GW7647 is effective in vivo to alleviate anaemic 
symptoms. a, Wild-type mice were pretreated with DMSO or 100 pg kg * 
GW7647 for 3 days (days —3 to —1) before PHZ injection on day 0. Mice 
were subject to daily DMSO or GW7647 injections during days 0-6. 
Haemoglobin (HGB), red blood cells (RBC) and haematocrit (HCT) were 
measured in wild-type mice on the days indicated. n = 6. b, Nan/+ mutant 
mice were injected with DMSO or GW7647 (100 pg kg” ') for 18 days. Each dot 
represents one mouse. *P < 0.05, **P < 0.01, ***P < 0.001, Student’s t-test. 


analysis to interrogate the genome-wide chromatin occupancy of GR 
and PPAR-o in mouse BFU-E cells. Compared to untreated cells, GR 
and PPAR-o occupancy were both induced upon DEX treatment 
alone; addition of GW7647 further enhanced PPAR-« but not GR 
occupancy (Fig. 4a). These results indicate that GW7647 enhances 
the recruitment of PPAR-« to GR-binding sites in BFU-Es. In 
BFU-Es co-treated with GW7647 and DEX, our ChIP-seq detected 
1,058 GR and 1,623 PPAR-« peaks. GR and PPAR-« both predomi- 
nantly occupied distal intergenic (>63%) and intronic chromatin sites 
(>25%) (Extended Data Fig. 6a). Notably, in 719 peaks, GR and PPAR- 
a. localize in close proximity (Extended Data Fig. 6b); 67.9% of total GR 
peaks and 44.3% of PPAR-« peaks co-localize. The DNA sequences 
underlying these overlapping peaks are enriched for DNA-binding 
motifs for several transcription factors including PU.1, YY1, Smad3, 
Tall, Kif4, Hif-2x and Myb (Extended Data Fig. 6c); most of these 
transcription factors are known to have crucial roles in stem-cell self- 
renewal”. In addition, co-treatment with DEX and GW7647 also upre- 
gulates many genes critical for stem-cell self-renewal (Supplementary 
Discussion and Extended Data Figs 7, 8). 

DEX treatment of BFU-E cells leads to a slight increase in PPAR-o 
protein levels, and this is further increased to 1.6 times that of control 
cells by treatment with DEX and GW7647 (Fig. 4c). In contrast, the GR 
protein level is not altered in BFU-Es under these conditions. Consistent 
with these observations, addition of DEX to BFU-E cells induced the 
binding of both GR and PPAR-« to a chromatin site ~5 kb upstream of 
the transcription start site of Ppara, presumably part of a Ppara gene 
enhancer, and the binding of PPAR-« to this chromatin site was further 
enhanced by GW7647 addition (Fig. 4b). Our data suggest that there is a 
positive autoregulatory feedback loop of PPAR-« expression during 
BFU-E self-renewal promoted by DEX and GW7647. 

Co-immunoprecipitation experiments demonstrated an interaction 
of GR and PPAR-« only in BFU-E cells treated with DEX with or 
without GW7647 (Fig. 4d). While addition of DEX to cultures of 
BFU-E cells stimulated PPAR-« binding to the GR, interactions between 
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Figure 4 | PPAR-a and glucocorticoid signalling pathways regulate a 
common target gene ensemble. a, Density maps of GR and PPAR-a ChIP-seq 
signals in mouse BFU-E cells. GR-binding peaks in DEX and GW7647 co- 
treated samples were used as the reference to search for corresponding ChIP- 
seq signals of PPAR-a.. b, GR and PPAR-« occupancy across the Ppara locus in 
BFU-E cells treated as indicated. c, Left, protein level of GR, PPAR-« and 
B-actin in mouse BFU-E cells treated as indicated. Right, densitometry 
quantification of GR and PPAR-« protein expression levels normalized to 
B-actin. Experiments were repeated three times. *P < 0.05, **P < 0.01, 
Student’s t-test. d, Co-immunoprecipitation (IP) demonstrating interaction 
between GR and PPAR-« in mouse BFU-E cells. 


PPAR-o and GR were more pronounced after treatment with both DEX 
and GW7647. These results extend our ChIP-seq data, indicating that 
there is a physical interaction between GR and PPAR-« and probably 
other proteins that underlies DEX- and GW7647-enhanced BFU-E self- 
renewal (Supplementary Discussion and Extended Data Fig. 9). 

Given that little is known concerning endogenous PPAR-« ligands, 
the precise role of PPAR-o in haematopoiesis, and erythropoiesis in 
particular, remains elusive. Nonetheless, our finding that agonists of 
PPAR-« lead to enhanced binding of PPAR-« to many chromatin sites 
and to an increase in corticosteroid-induced BFU-E self-renewal and 
erythroid expansion, point to a previously unappreciated role for this 
nuclear receptor in haematopoiesis (Extended Data Fig. 10). The func- 
tion of PPAR-« has been mainly studied in nutrient metabolism and 
energy homeostasis, and fenofibrate is already an FDA-approved drug 
for dyslipidaemia treatment'*. Given that there is a very limited num- 
ber of drugs that can be used to treat Epo-resistant anaemias, the 
development of new drugs or repurposing current drugs to treat 
these diseases is challenging. Our surprising discovery suggests a novel 
function of PPAR-« in self-renewal of early committed erythroid 
progenitors, which could potentially lead to new therapeutics to treat 
Epo-resistant anaemias such as DBA. 
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METHODS 


Reagents. Chemicals were obtained from Sigma. Human peripheral blood gran- 
ulocyte-colony stimulating factor (G-CSF)-mobilized haematopoietic stem/pro- 
genitor cells enriched for CD34" were purchased from the Fred Hutchinson 
Cancer Research Center (FHCRC). StemSpan SFEM, CC100 cytokine cocktail, 
fetal bovine serum and BSA were purchased from STEMCELL Technologies. Holo 
human transferrin was from Sigma. Recombinant human and murine stem cell 
factor (rhSCF, rmSCF) and interleukin-3 (rhIL-3, rmIL-3), recombinant murine 
interleukin-6 (rmIL-6) and recombinant murine insulin like growth factor-1 
(rmIGF-1) were from Peprotech. Recombinant human erythropoietin (rhEpo) 
was from Amgen. Antibodies used were as follows. Santa Cruz: GR: H300 (sc- 
8992), M-20 (sc-1004); PPAR-o: H-98 (sc-9000). B-actin: N-21(sc-130656); RPS19 
antibody is from Abcam (ab57643). FACS antibodies are from eBioscience: anti- 
human c-kit PE (#12-1178), anti-human CD71 FITC (#11-0719), anti-human 
CD235a APC (#17-9987), anti-mouse CD71 PE (#12-0711), and anti-mouse 
Terl19 APC (#17-5921). Real-time PCR primers (mouse): Hbb-F, 5’-TTTAAC 
GATGGCCTGAATCACTT-3'; Hbb-R, 5’-CAGCACAATCACGATCATATT 
GC-3’; Slc4a1-F, 5'-GGACAGATAGCATATAGAGACCTAACCA-3’; Slc4al-R, 
5'-CGTAGTCTGTGGCTGTTTGCTC-3’;  Egln2-F, 5'-CTGGGCAACTACG 
TCATCAAT-3’; Egln2-R, 5’-CCGCCATGCACCTTAACATC-3’; Hmges2-F, 
5'-GAAGAGAGCGATGCAGGAAAC-3’, Hmgcs2-R, 5’-GTCCACATATTGG 
GCTGGAAA-3’; Kit-F, 5'‘-GITTCTGCTCCTACTGCTTCGC-3’; Kit-R, 5'-TAA 
CAGCCTAATCTCGTCGCC-3’. 

shRNA sequences: human PPARA shRNA-1, 5'‘-GGAGTTTATGAGGCCA 
TATTC-3’; human PPARA shRNA-2, 5'-GCTTTACGGAATACCAGTATT-3’; 
mouse Pu.] shRNA, 5’-CCATGTCCACAACAACGAGTT-3’. ChIP-PCR pri- 
mers: Kit ChIP-PCR-F, 5’-GTCACAGCCACCAGAGAGAG-3’; Kit ChIP- 
PCR-R, 5'-TGGCAATGTTAAGAAGTGGTGG-3’; Ppara ChIP-PCR-F, 5'-CC 
AGGGCTACACAGAGAAAC-3'; Ppara ChIP-PCR-R, 5'-TTGGCACAACTC 
TCAGGCTA-3’. 

RPS19 shRNA is described in a previously published study”. 

Ex vivo human CD34* erythroid culture. Our human CD34" cell erythroid 
differentiation method is composed of 4 phases over 21 days: Expansion (day 0-4), 
differentiation (Dif) I (day 5-9), II (day 10-13), and III (day 14-21). Cells are 
thawed according to the FHCRC protocol, and cultured in expansion medium 
(StemSpan SFEM, CC100 cytokine cocktail and 2% penicillin-streptomycin) at 
10° cells ml”! from day 0 to 4. After expansion, cells are subsequently cultured in 
Iscove’s modified Dulbecco’s medium (IMDM)-based erythroid differentiation 
medium supplemented with different cytokines in Dif I, II and III. The medium 
base for all three differentiation phases comprises: IMDM, 15% FBS, 2 mM glu- 
tamine, 1% BSA, 500 Lg ml | holo human transferrin, 10 Lug ml! recombinant 
human insulin and 2% penicillin-streptomycin. On days 5-9, cells are cultured in 
Dif I medium, which contains erythroid medium base, 1 14M DEX, 1 uM B-oes- 
tradiol, 5 ng ml 1 rhIL-3, 100 ngml- 1yhSCFand6U ml! rhEpo. On days 10-13, 
cells are grown in Dif II medium containing erythroid medium base, 50 ng ml! 
rhSCF and 6 U ml! rhEpo. On days 14-21, cells are cultured in fibronectin- 
coated plates in Dif III medium, which is erythroid medium base supplemented 
with 2U ml? rhEpo. GW7647 was added in both Dif I (100 nM) and Dif II (10 
nM) when indicated. Cell numbers re-seeded in the beginning of Dif I, I and III 
were 10°, 2 X 10°, and 3 X 10° per ml, respectively. 

Mouse fetal liver BFU-E and CFU-E isolation and culture. Purification of BFU- 
E and CFU-E cells from murine fetal livers was performed as described prev- 
iously’®. Briefly, fetal liver erythroid cells were isolated between E14.5 and 
E15.5. A pure erythroid progenitor population containing more than 90% of 
BFU-E and CFU-E cells was obtained through a negative magnetic bead selection 
for lineage markers including Ter119, B220, Mac-1, CD3, Gr-1, Sca-1, CD16/ 
CD32, CD41, CD34 and positive selection for c-Kit. Afterwards c-Kit* BFU-E 
(CD71 10%!) and CFU-E (CD71 20%") cells were separated by flow cytome- 
try. Purified BFU-E or CFU-E cells were seeded in serum-free erythroid liquid 
expansion medium (SFELE) (StemSpan SFEM with 100 ng ml ' rmSCE, 40 ng 
ml! rmIGF-1, and2 U ml“! rhEpo) alone, or SFELE containing 100 nM DEX + 
10 uM GW7647. Erythroblasts produced from purified BFU-E and CFU-E were 
analysed over time by FACS and benzidine-Giemsa staining as described"®. 

For Pu.1 knockdown, primary BFU-Es purified from mouse E14.5 fetal liver 
were incubated with virus encoding a shRNA for LacZ or mouse Pu.1 at 37 °C 
overnight. After incubation, cells were cultured in SFELE medium with or without 
DEX and/or GW7647. 

Colony-forming assay. Murine BFU-E colony-forming assays were performed in 
MethoCult M3234 (StemCell Technologies) containing 10 U ml‘ rhEpo, 20 ng 
ml! rmIL-3, 20 ng ml! rmIL-6 and 50 ng ml 1 »mSCE, with or without 100 nM 
DEX + 10 1M GW7647. For CFU-E colony-forming assays, cells were cultured 


in MethoCult M3234 containing 10 U ml! rhEpo, with or without 100 nM 
DEX + 10 uM GW7647. The number of CFU-E or BFU-E colonies was scored 
after 3 days or 7 days in culture, respectively. 

For human colony-forming assays, cells were plated in MethoCult H4034 

Optimum (StemCell Technologies) without additional cytokines. Cells were cul- 
tured for 12-14 days before BFU-E and CFU-E colonies were scored. 
In vivo mice experiments to test PPAR-@ agonists. 129/SvIm] (wild-type), 
129S84/SvJae (wild-type), or 129S4/SvJae-Ppara ""'S°"*/] female mice (The 
Jackson Laboratory), 6-8 weeks of age, randomized by weight, were pretreated 
with GW7647 (100 Lg kg’) for 3 days (days —3 to —1). On day 0, the mice were 
injected with phenylhydrazine (PHZ) (60 mg kg” '). GW7647 injection was con- 
tinued during days 0-6. Whole blood samples were collected at each day of day 
1-5, and day 7 and 9 for complete blood count (CBC) analyses. 

Four- to six-week-old Nan/+ mutant mice (officially designated as KIfl 
http://www. informatics.jax.org/allele/MGI:1861107), obtained from the Jackson 
Laboratory, were injected with GW7647 (100 ig kg~') for 18 days. Mice were 
randomized by weight. Whole blood samples were collected every 3 days for 
CBC analyses. 

All mouse procedures were approved by the Animal Care and Use Committees 
of the Massachusetts Institute of Technology. 

Loss-of-function assay in human CD34* erythroid culture system 

The lentiviral backbone vector pLKO.1 and packaging plasmids were trans- 
fected into 293T cells. Supernatants containing viral particles were harvested at 48 
and 72 h. Primary human CD34* haematopoietic cells were transduced with 
lentivirus 1 day after thawing with the presence of 2 pg ml’ polybrene 
(Sigma). Twenty four hours after viral transduction, cells were selected by growing 
in culture medium containing 1 jg ml * puromycin (Sigma) for 2 days. 

For RPS19 knockdown, 5 X 10° CD34 cells at day 1 of culture were infected with 
lentivirus encoding GFP and a shRNA targeting human RPS19 (ref. 21) or a 
scrambled shRNA. Cells were then treated with DMSO or 0.01 uM, 0.1 1M or 1 
uM GW7647. After 48 h, GFP-positive cells were isolated by flow cytometry and 
returned to culture. Colony-forming assays were conducted at day 6. CD71 
expression was analysed at day 9 and CD235 expression was analysed at day 21 
to determine the percentage of erythroid cells. Total cell numbers were also 
counted at the end of each differentiation stage. 

ChIP-seq and de novo motif discovery. Mouse ChIP-seq experiments in BFU-Es 
were conducted as described before’. 10”? mouse BFU-E cells purified from 
E14.5 fetal liver with or without treatment were used per immunoprecipitation. 
Besides specific antibodies, we included species-matched IgG as control, and spe- 
cies-matched IgG yielded little to no signals. Purified DNA was prepared for sequen- 
cing according to a modified version of the Solexa genomic DNA protocol. ChIP-seq 
fragments as well as inputs were barcoded and sequenced on an Illumina HiSeq 
sequencer. Approximately 30 million 40-nucleotide-long single reads per sample 
were obtained. Adapters were removed and reads shorter than 20 nucleotides were 
discarded. Reads were mapped with Bowtiel (ref. 25). Peaks were called with MACS 
1.4 using the corresponding input for each sample and “mfold” set to 5,30 (ref. 26). 
Weselected peaks with fold enrichment = 10. Plots showing density of reads around 
the peak summits were done with ngsplots. For de novo motif discovery, Homer was 
used with default setting to find motifs in GR and PPAR-« overlapping peaks”’. 
RNA-seq. BFU-E cells from E14.5 mouse fetal livers were isolated as described’. 
Total RNA was purified from the mouse BFU-Es. Samples for paired-end mRNA- 
seq were prepared using the Solexa kit according to the manufacturer’s instructions. 
RNA samples from two replicas of cells untreated, or treated with DEX + GW7647 
for 12 h were sequenced on an Illumina HiSeq sequencer. Around 50 million 100- 
nucleotide paired-end reads were obtained from each sample. Reads were trimmed 
to remove low quality reads using FASTQ quality trimmer with a quality threshold 
of 20 (—t 20) and a minimum length to keep 25 nucleotides (—/ 25). Reads that 
still had both pairs after the trimming step were mapped with TopHat”* using 
gene models from ENSEMBL Genes 67, Mus musculus genes NCBIM37 
(Mus_musculus.NCBIM37.67). The number of reads mapped to each gene was 
obtained with HTseq-count. Differential expression was assayed using DE-seq”’. 

For each of the 719 peaks bound by both GR and PPAR-« in the presence of 
GW7647 and DEX, we found the closest gene using the closestBed tool’’. We 
filtered out any gene at a distance higher than 10 kb from a peak. 
Immunoprecipitation. Mouse BFU-Es were isolated from E14.5 mouse fetal 
livers. Cells were cultured in SFELE medium alone or treated with 100 nM DEX 
with or without 10 1M GW7647 for 12 h. Whole-cell lysates were prepared from 
4 X 10’ cells using RIPA lysis buffer (sc-24948, Santa Cruz Biotechnology) with 
protease inhibitor cocktail. The lysates were pre-cleared by incubating with protein 
A-Sepharose for 1 h at 4 °C and centrifugation. The supernatant was immunopre- 
cipitated with 1 jig rabbit IgG or anti-GR antibody overnight at 4 °C. Immune 
complexes were collected by incubation with protein A-Sepharose for 4 h at 4 °C 


Nan, 
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and washed five times at 4 °C with lysis buffer. The immune complexes adsorbed 
to the beads were centrifuged and the supernatant was removed. Fifty microlitres 
of 1X loading buffer was added to the samples and boiled at 95 °C for 5 min. 
Proteins were resolved by SDS-PAGE and immunoblotted by GR and PPAR-o 
antibodies. 

Quantitative real-time RT-PCR. Total RNA from mouse BFU-Es was purified 
with TRIzol (Invitrogen). cDNA was prepared from 1 ig RNA. Reaction mixtures 
(15 pl) contained 2.0 pl of cDNA, 7.5 pl of SYBR green master mix (Applied 
Biosystems) and appropriate primers. Product was monitored by SYBR green 
fluorescence. Control reactions lacking RT yielded little to no signal. Relative 
expression levels were determined from a AAC, method and were normalized 
to 18S rRNA expression. 
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Extended Data Figure 1 | PPAR-a agonist GW7647 does not have adverse 
effects on erythroid differentiation and has no effects on CFU-E cells. 

a, Gene expression changes of nuclear receptors in BFU-E cells from RNA-seq 
results published previously’’. b, Flow cytometry analyses of CD71 and Ter119 
markers to demonstrate erythroid differentiation of mouse BFU-E cells after 9 
days of culture with the indicated additions. c, Ppara gene expression in BFU-E, 
CFU-E and Ter119* erythroblasts. BFU-E, CFU-E and Ter119* erythroid cells 
were isolated from E14.5 mouse fetal livers as described”*. Total RNA was 
purified for quantitative PCR analysis. Ppara gene expression was normalized 
to mouse 18S rRNA in different stages. Error bars represent mean + s.d. from 
three independent experiments. d, DNase I hypersensitivity (HS) analysis at 
Ppara promoter region in different mouse cells from Encode. e, Production of 
mouse erythroblasts from isolated CFU-E cells. Wild-type mouse CFU-E 
cells from E14.5 fetal livers were untreated (black line) or treated with DEX 
(blue line), GW7647 (red line) or fenofibrate (green line). Error bars represent 
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mean ~ s.d. from three independent experiments. f, Colony-forming assays 
were conducted at 48 h after compound treatment to determine BFU-E colony 
numbers from 100 mouse BFU-E cells cultured under the indicated conditions. 
*P < 0.05, Student’s t-test. Error bars represent mean + s.d. from three 
independent experiments. g, At day 3, BFU-E colony numbers from 100 
purified mouse BFU-E cells were quantified by colony forming assays. 100 
purified mouse BFU-E cells were untreated or treated with DEX alone or DEX 
in combination with agonists or antagonists targeting PPAR receptors (a, Y or 
B). BFU-E colonies were quantified after 8 days in culture. *P < 0.05, **P < 
0.01, Student’s f-test. Error bars represent mean = s.d. from three independent 
experiments. h, Real-time PCR analysis of gene expression in DEX-treated and 
DEX+GW7647-treated wild-type or Ppara_‘~ mouse BFU-E cells. * P< 0.05, 
*** P < 0.001, Student’s t-test. Error bars represent mean + s.d. from three 
independent experiments. 
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Extended Data Figure 2 | Human CD34* erythroid differentiation system. 
a, Total CFU-E colonies formed during days 0-9. CFU-E colony numbers were 
quantified by plating 1,000 cells from various time points during days 0-9 of the 
human CD34" erythroid culture on methylcellulose. CFU-E colonies were 
quantified after 12-14 days. Total CFU-E colony numbers in culture under 
conditions without GW7647 (black line) or with GW7647 (red line) were 
calculated using the total cell numbers at corresponding time points in Fig. 2a. 
b, Human CD34" cells were treated at day 1 with 100 nM GW7647 with or 
without DEX at the concentration indicated in the figure. At day 6, total cell 
numbers were counted and cells were collected for BFU-E colony assays. 

c, Protein expression of PPAR-o demonstrating shRNA knockdown efficiency 
via lentiviral transduction. LacZ shRNA is used as a control. shRNA-1 and -2 
are both specific for PPAR-o. shRNA-2 has higher knockdown efficiency. 

d, Cell pellets of 1 million cells demonstrating haemoglobin accumulation 
during the differentiation process. e, Flow cytometry analyses of erythroid 
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markers during the 21-day human CD34" erythroid culture. Top, c-KIT versus 
CD235a; middle, CD71 versus CD235a. Note the sequential induction of c-kit, 
CD71 and CD235a, as well as the sequential downregulation of c-kit and CD71. 
Bottom, enucleated reticulocytes are CD235a‘ Hoechst , nuclei are 

CD235a_ Hoechst*, and nucleated erythroblasts are CD235a‘ Hoechst™. 
Enucleation rate is 32.6/(32.6+37.7) X 100% = 46.4%. f, Summary of high- 
performance liquid chromatography (HPLC) results using haemolysates of 
cultured reticulocytes and normal human RBCs (control). Top, total protein 
composition of haemolysates. Bottom, haemoglobin composition of 
haemolysates. Cultured reticulocytes contain more than 90% of adult globins. 
g, Size measurement of enucleated reticulocytes by both diameter and area. 
Scale bar, 10 jum. h, Benzidine-Giemsa staining of human reticulocytes 
cultured with or without GW7647. Scale bar, 12 um. *P < 0.05, Student’s t-test. 
Error bars represent mean + s.d. from three independent experiments. 
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Extended Data Figure 3 | GW7647 increases erythroid progenitors and 
CD235a* cells in RPS19-knockdown human progenitor cells. a, Human 
CD34* haematopoietic progenitors were transduced with lentivirus encoding 
GFP and either a scrambled shRNA or an shRNA targeting RPS19. Then 
transduced cells were treated with or without 100 nM GW7647. After 48 h, 
GFP* cells were sorted by FACS and plated for BFU-E and CFU-E colony- 
forming assays. RPS19 knockdown efficiency is shown at the bottom. M.W., 
molecular weight. *P < 0.05, Student’s t-test. Error bars represent mean ~ s.d. 
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from three independent experiments. b, Sorted GFP* cells were returned to 
culture with the indicated concentration of GW7647. Top, percentage of 
CD71" cells at day 9 was determined by FACS. Bottom, percentage of 
CD235a‘ cells at day 21 was determined by FACS. *P < 0.05, Student’s t-test. 
Error bars represent mean + s.d. from three independent experiments. c. Total 
cell numbers generated from one GFP-positive cell at the indicated times of 
culture. *P < 0.05, Student’s t-test. Error bars represent mean + s.d. from three 
independent experiments. 
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Extended Data Figure 4 | GW7647 improves anaemia in two mouse models (HCT) were measured on the days indicated. Error bars represent mean = s.d. 
of anaemia. a, Experimental scheme for PHZ- induced haemolytic anaemia, _ from six mice. c, BFU-E and CFU-E colony-forming assays of spleen or bone 
used also in Fig. 3a. Wild-type or Ppara '~ mice were pretreated with DMSO marrow cells. Wild-type or Ppara_‘~ mice were treated with PHZ and DMSO 
(control) or GW7647 (100 pg kg") for 3 days (days —3 to —1) before PHZ (control) or GW7647 (100 pg kg’) as described in earlier. *P < 0.05, Student’s 
injection on day 0. Mice were subject to daily DMSO or GW7647 injections t-test. Error bars represent mean + s.d. from three independent experiments. 
during days 0-6. Red arrows indicate days of blood sample collection. d, Spleen and bone marrow cells were harvested. Representative flow cytometry 
b, Ppara'~ mice were treated with DMSO or GW7647 and then injected with _ analysis of spleen erythroblasts isolated from GW7647- or DMSO-treated wild- 
PHZ. Haemoglobin (HGB), red blood cell numbers (RBC) and haematocrit type mice at day 3 after PHZ injection. FSC-A, forward scatter area. 
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Extended Data Figure 5 | GW7647 increases BFU-E numbers in Nan/+ 
mutant mice. a, Corticosteroid levels in serum were measured in wild-type and 
Nan/+ mutant mice. Each dot represents one mouse. *P < 0.05, Student’s 
t-test. Error bars represent mean = s.d. from all mice. b, Increase of BFU-E 
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at day 18. *P < 0.05, Student’s t-test. c, Total numbers of white blood cells 
(WBC) and platelets from peripheral blood samples were measured at day 0 
and day 18. Each dot represents one mouse. Error bars represent mean + s.d. 
from all mice. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


GR PPAR-a 
Hi GR 
TTS 0.9% TTS 1.1% CJ] PPAR-o 
TSS 1.9% > UTR OT TSS 2.8% 5’ UTR 0.3% 
3’ UTR 0.7% 3’ UTR 1.2% 
Exon 4.3% Exon 3.6% 
339 904 
Intron 28% Intron 25.4% 
Distal Intergenic Distal Intergenic 
63.8% 65.7% 
Cc Transcription e 
Motif factor p-value 8004 C4 LaczshRNA 


1.2 


CACTTCCICL 
CGACGACGACGA __Gmebs 
ATGGCGGTGACG yy 


PU.1 1e-350 
1e-227 


1e-81 


CCCGCOGTITAC smads_1e46 3 
TSGCOCLEGA reo ea] 
AACCATCTAGTA ta 1¢-36 3 os 
ARGGREGESGCS kita te-32 
ATTSACEXTGAC Raw 1032 
GTCTACGCGC Hit2a = 1e-22 

shRNA: 


Extended Data Figure 6 | Bioinformatic analyses of mouse BFU-E cells. 

a, Genome-wide distribution of GR and PPAR-« chromatin occupancy sites in 
BFU-E cells. ChIP-seq analyses of GR and PPAR-« occupancy in mouse BFU-E 
cells isolated from DEX- and GW7647-treated wild-type E14.5 fetal livers. TSS, 
transcription start site; TTS, transcription termination site; UTR, untranslated 
region. Distal intergenic, regions greater than 3 kb from TSS. b, Venn diagram 
showing overlap between GR and PPAR-o chromatin occupancy sites. c, De 
novo motif searching of the overlapped chromatin sites occupied by GR and 
PPAR-«. The table depicts transcription-factor-binding motifs enriched at GR 
and PPAR-« overlapping sites relative to genomic background and associated P 
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values. d, Real-time PCR analysis of Pu.1 gene expression in mouse BFU-E cells 
transduced with virus encoding either LacZ shRNA or Pu.1 shRNA. *P < 0.05, 
Student’s t-test. Error bars represent mean + s.d. from three independent 
experiments. e, Colony-forming assays were conducted to determine BFU-E 
colony numbers from 100 mouse BFU-E cells infected with virus encoding 
either LacZ shRNA or Pu. 1 shRNA. Cells were cultured in SFELE medium with 
or without DEX + GW7647. Colony-forming assays were performed at 48 h. 
*P < 0.05, Student’s t-test. Error bars represent mean ~ s.d. from three 
independent experiments. 
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Extended Data Figure 7 | Kit is a target gene of GR and PPAR-a. a, Real- _ without GW7647 as described in Fig, 2. Top, at day 9 of culture, cell-surface KIT 
time RT-PCR analysis of Kit gene expression in wild-typeand Ppara ‘mouse and CD71 expression were analysed by flow cytometry. Bottom, a 

BFU-E cells untreated or treated with DEX with or without the addition of representative histogram of KIT expression in cells treated or untreated with 
GW7647. *P < 0.05, Student’s t-test. Error bars represent mean + s.d. from | GW7647.c, ChIP-seq occupancy signal map of GR and PPAR-« across the Kit 
three independent experiments. b, Human CD34" cells were treated with or _ locus in BFU-E cells. 
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Extended Data Figure 8 | Pathways modulated by PPAR-o activation. Pathway analysis of RNA-seq data of genes that are up- or downregulated by more than 
50%, comparing cultures treated with DEX alone or DEX+GW7647. 
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Extended Data Figure 9 | PPAR-g antagonist interferes with the function of 
PPAR-a but not GR. a, Quantitative ChIP analysis of GR and PPAR-o 
occupancy at Kit and Ppara loci in mouse BFU-E cells following the indicated 
treatments. Units are arbitrary; signals using rabbit IgG are represented by 
grey dot lines across the plots. *P < 0.05, **P < 0.01, Student’s t-test. 

Error bars represent mean + s.d. from three independent experiments. 

b, Co-immunoprecipitation (IP) measuring interaction between GR and 
PPAR-o in mouse BFU-E cells isolated from E14.5 fetal livers in wild-type or 


Ppara '~ mice. BFU-E cells were untreated, or treated with DEX with or 
without GW7647 with or without GW6471. Whole-cell lysates were incubated 
with anti-GR antibody and immunoprecipitates were probed with specific 
antibodies as indicated. M.W., molecular weight. c, Colony-forming assays to 
determine BFU-E colony numbers from 100 mouse BFU-E cells cultured 
with the indicated treatments. *P < 0.05, Student’s t-test. Error bars represent 
mean + s.d. from three independent experiments. 
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Extended Data Figure 10 | Model of synergism between PPAR-a and GR to 
promote BFU-E self-renewal. BFU-E cells normally undergo limited self- 
renewal to generate CFU-Es, which can differentiate into erythroblasts. GR is 
sequestered in the cytoplasm without glucocorticoids such as DEX. Upon 
glucocorticoid treatment, liganded GR will be translocated into the nucleus and 
bind to chromatin to regulate gene transcription important for BFU-E self- 
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renewal. PPAR-o is often recruited to chromatin sites that are in close 
proximity to GR by glucocorticoid treatment alone without any function on 
BFU-E self-renewal. Upon glucocorticoid and PPAR-« agonist co-treatment, 
activated PPAR-o interacts with GR to modulate GR transcriptional activity. 
This leads to enhanced BFU-E self-renewal, and over time generates more 
CFU-Es and erythroblasts. 
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A naturally occurring variant of the human prion 
protein completely prevents prion disease 


Emmanuel A. Asante!, Michelle Smidak', Andrew Grimshaw!, Richard Houghton't, Andrew Tomlinson’, Asif Jeelani’, 
Tatiana J akubcova', Shyma Hamdan, Angela Richard-Londt', J acqueline M. Linehan', Sebastian Brandner’, Michael Alpers’, 
Jerome Whitfield’, Simon Mead!, Jonathan D. F. Wadsworth! & John Collinge! 


Mammalian prions, transmissible agents causing lethal neuro- 
degenerative diseases, are composed of assemblies of misfolded 
cellular prion protein (PrP)'. A novel PrP variant, G127V, was 
under positive evolutionary selection during the epidemic of 
kuru—an acquired prion disease epidemic of the Fore population 
in Papua New Guinea—and appeared to provide strong protection 
against disease in the heterozygous state’. Here we have investi- 
gated the protective role of this variant and its interaction with the 
common, worldwide M129V PrP polymorphism. V127 was seen 
exclusively on a M129 PRNP allele. We demonstrate that trans- 
genic mice expressing both variant and wild-type human PrP are 
completely resistant to both kuru and classical Creutzfeldt-Jakob 
disease (CJD) prions (which are closely similar) but can be infected 
with variant CJD prions, a human prion strain resulting from 
exposure to bovine spongiform encephalopathy prions to which 
the Fore were not exposed. Notably, mice expressing only PrP V127 
were completely resistant to all prion strains, demonstrating a 
different molecular mechanism to M129V, which provides its rela- 
tive protection against classical CJD and kuru in the heterozygous 
state. Indeed, this single amino acid substitution (G—V) at a 
residue invariant in vertebrate evolution is as protective as deletion 
of the protein. Further study in transgenic mice expressing differ- 
ent ratios of variant and wild-type PrP indicates that not only is 
PrP V127 completely refractory to prion conversion but acts as a 
potent dose-dependent inhibitor of wild-type prion propagation. 

Prions cause fatal neurodegenerative conditions such as scrapie in 
sheep, bovine spongiform encephalopathy (BSE) in cattle and CJD in 
humans". The fundamental molecular process—seeded propagation of 
assemblies of misfolded host protein—is increasingly recognized as 
being of importance in all the major human neurodegenerative dis- 
eases’. There is a common polymorphism, present worldwide, in the 
substrate protein in human prion disease, human prion protein (PrP), 
where either methionine (M) or valine (V) is present at residue 129. 
MV heterozygosity provides relative protection against acquired, 
sporadic and some inherited prion diseases*® and may have been 
selected during the evolution of modern humans by ancestral prion 
disease epidemics’. This protective effect is thought to relate to inhibi- 
tion of homotypic protein-protein interactions**, although residue 
129 also influences the propagation of particular prion strains via 
conformational selection”’®. Heterozygosity at another polymorph- 
ism, E219K, also provides resistance to CJD in Japan''. 

Kuru was a devastating epidemic prion disease transmitted by endo- 
cannibalism and restricted to a remote area in Papua New Guinea. We 
reported a novel PrP variant (G127V) among unaffected individuals 
that appeared to be a resistance factor selected by the epidemic and 
unique to this region’. Given the proximity to residue 129, we con- 
sidered that it may have a similar action to M129V; that is, blocking 
homotypic interactions and exerting its protective effect only in 
the heterozygous state. No PRNP codon 127VV homozygotes were 


identified in the kuru-exposed population and V127 was always seen 
onan M129 allele’. We therefore generated multiple lines of transgenic 
mice, expressing only human PrP (HuPrP) on a congenic FVB/N PrP 
null (Prnp”°) background, to investigate whether G127V was indeed 
protective and whether this protection was dependent on heterozyg- 
osity or was an intrinsic property of the variant protein. Furthermore, 
we investigated its interaction with the residue 129 polymorphism. 

We generated two lines of transgenic mice homozygous for HuPrP 
V127: Tg(HuPrP V127M129/V127M129 Prnp””)-183 (hereafter 
referred to as V127M129 Tgl183) and Tg(HuPrP V127M129/ 
V127M129 Prnp”°)-190 (V127M129 Tg190). PrP expression levels 
in homozygotes as compared to pooled normal human brain were 
twofold for V127M129 Tg183 and onefold for V127M129 Tg190. 
G127V heterozygous mice were derived by crossing these lines with 
FVB-congenic versions of Tg35 mice homozygous for HuPrP 
G127M129 (refs 12-15) or Tg152 mice homozygous for HuPrP 
G127V129 (refs 15-19), designated Tg(HuPrP G127M129/ 
G127M129  Prnp”’)-35c (G127M129  Tg35c) and Tg(HuPrP 
G127V129/G127V129 Prnp”°)-152c (G127V129 Tg152c), respect- 
ively. G127M129 Tg35c and G127V129 Tg152c mice express wild- 
type HuPrP at two and six times, respectively, as compared to pooled 
normal human brain (Table 1). Extended Data Fig. 1 shows relative 
PrP© expression levels in all transgenic mice used in this study. 

All transgenic lines were challenged by intracerebral prion inocu- 
lation from well-characterized and previously transmitted human 
prion disease cases including all three PRNP codon 129 genotypes 
and comprising four cases of kuru, twelve cases of classical CJD and 
two cases of variant CJD (vCJD). 

Heterozygous HuPrP G127M129/V127M129 mice (Tg35c X 
Tgl83 and Tg35c X Tgl90) (Table 1), which have the genotype 
associated with disease resistance in the kuru-exposed human popu- 
lation, were completely resistant to all four kuru isolates (which 
included all three PRNP codon 129 genotypes and two molecular 
strain types) (Fig. la and Table 2a), while mice expressing wild-type 
HuPrP G127M129/G127M129 (Tg35c) (Fig. la and Table 2a) or 
G127V129/G127V 129 (Tg152c) (Table 2b) were fully susceptible with 
100% attack rates. This is consistent with the population genetic data 
suggesting kuru resistance of G127V individuals. We have previously 
reported that prion strains seen in kuru brain are indistinguishable 
from those seen in classical CJD patients’’. Similarly, none of the four 
classical CJD isolates from patients of the PRNP genotype 129MM 
transmitted to either line of G127M129/V127M129 mice (Table 2a 
and Fig. 1b), while all four transmitted uniformly to Tg35c mice 
expressing wild-type HuPrP (Fig. 1b and Table 2a). Remarkably, 
however, some (~10%) G127M129/V127M129 mice developed clin- 
ical disease, and a larger number (~70%) showed evidence of sub- 
clinical infection (positive PrP immunohistochemistry and/or western 
blot for PrPS‘), on challenge with vCJD prions (Table 2d and 
Extended Data Fig. 2). vCJD is a novel BSE-derived prion strain’?”°”" 
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Table 1 | Transgenic mice expressing G127 and V127 human PrP 
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Transgenic mice PRNP codon 127-129 genotype Total HuPrP* WT?+ G127 allele Variant? V127 allele PrP ratio 
expression level (x) expression level (x) expression level ( x) G127:V127 
Parental lines 
Tg35c G127M129/G127M129 2 2 7 - 
Tg152c G127V129/G127V129 6 6 = - 
Tg183 V127M129/V127M129 Z = 2 = 
Tg190 V127M129/V127M129 1 - 1 - 
F1 crosses 
Tg35c xX Tg183 G127M129/V127M129 2 ll 1 fe1 
Tg35c x Tg190 G127M129/V127M129 15 il 0.5 2:1 
Tg152c x Tg183 G127V129/V127M129 4 3 1 3:1 
Tg152c x Tg190 G127V129/V127M129 3.5 3 0.5 6:1 
Tg152c 7 (Hemi)8 G127V129 3 3 = - 


WT, wild type. 

* PrP expression level is relative to pooled 10% (w/v) normal human brain homogenate. 
+ Wild-type alleles are either G127M129 or G127V129. 

{Variant allele is V127M129. 

§ Generated by crossing Tg152c with FVB/PrP-null mice. 


to which the population in the kuru-affected area of Papua New 
Guinea were not exposed. 

The key question with respect to whether the protective effect of 
V127 is conferred via a similar mechanism to the M129V polymorph- 
ism was addressed by challenge of mice homozygous for HuPrP 


a Attack rate 127-129 genotype Transmission 
of infection (%) 
Kuru cia a5 G127M129/G127M129 100 
Tg35c 
aa: ie G127M129/V127M129 0 
Tg35c x Tg183 
0/49 > 
Rec wee a G127M129/V127M129 0 
n=4 
Tg35c x Tg190 
0/149 aie 
> \ V127M129/V127M129 0 
Tg183 & Tg190 
b 38/38 a 
Classical CUD [OS \ G127M129/G127M129 
129MM 
Tg35c 
0/34 —, 
+» Lo, G127M129/V127M129 0) 
Tg35c x Tg183 
0/29 aa 
[—* ay \ G127M129/V127M129 0 
n=4 
Tg35c x Tg190 
0/68 ao." 
> Be \ V127M129/V127M129 0 
Tg183 & Tg190 


Figure 1 | Transmission rates of human prions to transgenic mice 
homozygous or heterozygous for human PrP V127. a, b, Transgenic mice 
were intracerebrally inoculated with brain homogenate from patients with kuru 
(a) or classical CJD (b). Codon 127 and 129 PRNP genotypes of the recipient 
mice are shown. G, glycine; M, methionine; V, valine. G127M129 is a wild-type 
human allele, and the V127M129 allele is seen only in humans from the kuru- 
exposed population of Papua New Guinea. Attack rate reports the total of 
clinically affected and subclinically infected mice as a proportion of the number 
of inoculated mice after prolonged (>600 days) post-inoculation periods. 
Primary prion transmission data are reported in Tables 2 and 3. 


V127M129. While mice homozygous for wild-type HuPrP 
G127M129 or G127V129 are both highly susceptible to CJD prions, 
mice homozygous for HuPrP V127M129 were completely resistant to 
all 18 human prion disease isolates, including vCJD prions, with no 
clinical transmissions or evidence of subclinical infection (Fig. 1a, b, 
Table 3 and Extended Data Fig. 3). 

We also challenged mice heterozygous at both residues 127 and 129 
of HuPrP (G127V129/V127M129) produced by crossing Tg183 or 
Tg190 mice with Tg152c mice (Tables 1 and 2b). The higher level of 
expression of HuPrP V129 in Tg152c mice as compared to the other 
lines meant that there was a marked difference in the ratio of express- 
ion of the variant (V127M129) and wild-type (G127V129) HuPrP in 
these crosses (Table 1). While Tg183 and Tg35c had closely similar 
levels of expression of V127M129 and G127M129 HuPrP, respect- 
ively, such that the Tg35c X Tg183 cross closely modelled the human 
PRNP G127V genotype, the expression ratios of wild-type G127V129 
to variant V127M129 HuPrP were approximately 6:1 and 3:1 for the 
Tgl52c X Tg190 and Tg152c X Tg183 crosses, respectively (Table 1 
and Extended Data Figs 1 and 4). Interestingly, in these crosses some 
transmissions of kuru and classical CJD prions were seen (Table 2b 
and Extended Data Figs 4 and 5). In the Tg152c X Tg183 crosses, with 
a threefold excess of wild-type PrP, subclinical infections were seen 
with four out of the twelve isolates used, and one kuru isolate (110336) 
was associated with clinical disease (Table 2b). With the Tg152c x 
Tg190 crosses, however, showing a sixfold excess of wild-type PrP, 
transmissions were seen from all isolates, with the majority resulting 
in clinical disease (Table 2b). 

To investigate further the effect of HuPrP V127 on prion propaga- 
tion, we compared hemizygous Tg(HuPrP G127V129 Prnp”?)-152c 
mice with the Tg152c X Tg183 and Tg152c X Tg190 crosses (Table 
2c). The hemizygous line was challenged with two classical CJD and 
one kuru isolate that had resulted in a 100% attack rate in both crosses. 
Again, 100% attack rates were seen for all three isolates but the incuba- 
tion periods were significantly shorter (P < 0.0001; Table 2c) in the 
absence of HuPrP V127 than in either cross. These data, together with 
comparison of the two crosses themselves (Table 2b), are consistent 
with a dominant-negative effect of HuPrP V127 expression. Although 
dominant-negative inhibition has been reported for other natural 
polymorphic residues of mammalian PrP in transgenic mice, none 
has shown complete prevention of prion conversion when expressed at 
a 1:1 ratio with wild-type PrP. Dominant-negative effects on yeast 
prion propagation have also been reported”’. 

Our transgenic modelling of the HuPrP G127V polymorphism 
demonstrates that it confers strong protection against prion disease 
in the heterozygous state. However, most importantly, the molecular 
basis of this effect is clearly distinct from that proposed for the well- 
established HuPrP M129V polymorphism, which is protective against 
developing sporadic CJD only in the heterozygous state: inhibition of 
homotypic protein-protein interactions during the process of prion 
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Table 2 | Transmission of human prions to transgenic mice heterozygous for human PrP V127 


Inoculum Transmission data 
Aetiology Source code Human PrPS° Attack ratey+ Incubation period Attack rate} Incubation period Attack ratet Incubation period 
type* (days + s.e.m.) or (days + s.e.m.) or (days + s.e.m.) or 
(days p.i) (days p.i.) (days p.i.) 
a Tg35c G127M129/G127M129 Tg35c x Tg183 G127M129/ Tg35c x Tg190 G127M129/ 
V127M129 V127M129 
Kuru 1516 T3 VV 14/14 509 + 56.8) 0/11 >518-605 0/13 >517-605 
Kuru 1520 T3 VV 12/12 558 + 4 (4) 0/14 >525-622 0/14 >460-622 
Kuru 110336 T3 MV 13/13 454 + 14(10) 0/14 >522-606 0/11 >504-607 
Kuru 1518 T2 MM 11/11 493 + 32 (7) 0/12 >545-620 0/13 >544-603 
iCJD (GH) 1035 T1 MM 10/10 221 + 3 (10) 0/10 >462-603 0/8 >506-602 
sCJD 111058 T1 MM 8/8 231 +3 (8) 0/8 >467-605 0/7 >487-605 
iCJD (DM) 1026 T2 MM 10/10 256 + 9 (8) 0/8 >507-601 0/7 >520-602 
sCJD 17040 T2 MM 10/10 233 + 3 (10) 0/8 >582-604 0/7 >600-604 
b Tg152c G127V129/G127V129 Tg152c x Tg183 G127V129/ Tg152c x Tg190 G127V129/ 
V127M129 V127M129 
Kuru 516 T3 VV 9/9 218+ 1 (6) 0/13 >447-608 9/15 543,592 
Kuru 520 T3 VV 9/9 196 + 7 (7) 0/18 >491-616 13/13. 498 + 17 (5) 
Kuru 110336 T3 MV 15/15 212 +3 (11) 15/15 456 + 3 (15) 15/15 316+ 4(14) 
Kuru 518 T2 MM 11/11 211+4(8) 3/11 >494-620 13/14 468 + 13 (9) 
sCJD 280 T2 VV 8/8 203 + 5 (4) 0/4 >600-602 10/10 559 + 14 (8) 
sCJD 278 T2 VV 6/6 236 + 8 (6) 0/8 >424-602 8/9 582 
sCJD 284 T2 MV 9/9 338 + 4 (8) 0/9 >410-602 5/7 >545-608 
sCJD 11478 T2 MV 10/10 248 + 8 (8) 0/6 >451-608 2/7 >482-603 
sCJD 17394 T3 VV 9/9 213 +1(8) 7/7 >537-677 10/10 427 + 8(10) 
sCJD 764 T3 MV 10/10 219 = 5 (7) 9/10 >368-588 10/10 461+7(9) 
iCJD (GH) 12651 T3 VV 8/8 200 + 3 (6) 0/6 >567-607 Tif 538 + 2 (5) 
iCJD (GH) 020 T3 MV 9/9 211 +2 (6) 7/7 >489-602 10/10 399 + 8(9) 
c Tg152c x Tg183 G127V129/V127M129 Tg152c x Tg190 G127V129/ Tg152ct’° (Hemi) G127V129 
V127M129 
iCJD (GH) 020 T3 MV Tit >489-602 10/10 399 + 8 (9) 14/14 252 + 31 (14) 
sCJD 17394 T3 VV 7/7 >537-677 10/10 427 + 8 (10) 12/12 245 + 5t (12) 
Kuru 110336 T3 MV 5/15 456 + 3 (15) 15/15 316 + 44 (14) 15/15 222 + 2t (15) 
d Tg35c G127M129/G127M129 Tg35c x Tg183 G127M129/ Tg35c x Tg190 G127M129/ 
V127M129 V127M129 
vCJD 1342 T4 MM 5/15 >426-603 9/15 467, 556 17/18 419 + 17 (3) 
vCJD 17042 T4 MM 4/14 559, 561 4/13 >496-607 10/12 596 


iCJD, iatrogenic CJD; sCJD, sporadic CJD; GH, growth hormone; DM, dura mater; p.i., post inoculation. 
* According to classification of ref. 28. 
+ Attack rate is defined as the total number of both clinically affected and subclinically infected mice as a proportion of the total number of inoculated mice. Subclinical prion infection was assessed by 
immunoblotting and/or immunohistochemical examination of brain. Incubation periods are reported for clinically affected mice in days; where n = 3 the mean + s.e.m. is reported with the number of mice 
contributing to the mean shown in parentheses, otherwise individual incubation times are given. In groups where no clinical transmission of prion disease was observed, the attack rate represents subclinical 
infection only and the interval between inoculation and death (from either senescence, culling due to inter-current illness or termination of the experiment) is reported as >x-y days. 

+ Mean incubation periods for all three isolates are significantly lower in Tg152c hemizygotes than in Tg190 x Tg152c heterozygotes (P < 0.0001; two-tailed unpaired t-test). 


Table 3 | Transmission of human prions to transgenic mice homozygous for human PrP V127 


Inoculum Transmission data 
Aetiology Source code Human PrPS° Attack ratet+ Incubation period Attack ratet Incubation period 
type* (days p.i.) (days p.i.) 
Tg183 V127M129/V127M129 Tg190 V127M129/V127M129 
Kuru 516 T3 VV 0/19 >488-604 0/22 >391-609 
Kuru 520 T3 VV 0/19 >391-609 0/19 >370-609 
Kuru 110336 T3 MV 0/21 >405-607 0/24 >463-617 
Kuru 518 T2 MM 0/12 >439-600 0/13 >432-603 
vCJD 342 T4MM 0/8 >553-605 0/10 >506-604 
vCJD 17042 T4MM 0/11 >522-609 0/14 >446-607 
iCJD (GH) 035 T1 MM 0/10 >434-622 0/10 >450-602 
sCJD 111058 T1 MM 0/8 >411-609 0/9 >454-612 
iCJD (DM) 026 T2 MM 0/9 >381-602 0/8 >516-602 
sCJD 17040 T2 MM 0/8 >524-601 0/6 >564-600 
sCJD 280 T2 VV 0/9 >532-602 0/9 >425-601 
sCJD 278 T2 VV 0/7 >530-602 0/7 >417-601 
sCJD 284 T2 MV 0/6 >549-603 0/6 >466-609 
sCJD 11478 T2 MV 0/7 >517-602 0/7 >418-617 
sCJD 17394 T3 VV 0/8 >500-600 0/8 >600-603 
sCJD 764 T3 MV 0/10 >447-602 0/8 >484-607 
iCJD (GH) 12651 T3 VV 0/7 >563-606 0/7 >487-599 
iCJD (GH) 020 T3 MV 0/6 >546-603 0/6 >572-600 
p.i., days post inoculation; vCJD, variant CJD; sCJD, sporadic CJD; iCJD, iatrogenic CJD; GH, growth hormone; DM, dura mater. 
* According to classification of ref. 28. 
+ Attack rate is defined as the total of clinically affected and subclinically infected mice as a proportion of the number of inoculated mice. Subclinical prion infection was assessed by immunoblotting and/or 


immunohistochemical examination of brain. As no clinical transmission of prion disease was observed, the interval between inoculation and death (from either senescence, culling due to inter-current illness or 
termination of the experiment) is reported as >x-y days. 
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propagation. Here we demonstrate that HuPrP V127 is intrinsically 
resistant to prion conversion and indeed capable of inhibiting pro- 
pagation of wild-type prions in a dose-dependent manner. Mice 
expressing only HuPrP V127 appear to be as resistant to prion disease 
as PrP null mice”®, and understanding the structural basis of this effect 
may therefore provide critical insight into the molecular mechanism of 
mammalian prion propagation. 

At its height, when first recognized by Western medicine in 1957, 
kuru was a devastating epidemic largely affecting women and children 
and killing up to 2% of the population annually in some villages. Some 
villages became largely devoid of young women of childbearing age. 
While collapse of the Fore population was prevented by cessation of 
endocannibalism in the late 1950s, which interrupted the route of 
transmission and led to a gradual decline in incidence”’, our data 
suggest that if transmission had continued the epicentre of the affected 
region might have been repopulated with kuru-resistant individuals as 
a population genetic response to the epidemic. Understanding the 
structural basis of why HuPrP V127 is unable to propagate prions of 
multiple strain types may provide key insights into prion propagation 
and the development of rational therapeutics. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Ethics statement. Storage and biochemical analyses of post-mortem human brain 
samples and transmission studies to mice were performed with written informed 
consent from patients with capacity to give consent. Where patients were unable to 
give informed consent, assent was obtained from their relatives in accordance with 
UK legislation and Codes of Practice. Samples were stored and used in accordance 
with the Human Tissue Authority Codes of Practice and in line with the require- 
ments of the Human Tissue Authority licence held by UCL Institute of Neurology. 
This study was performed with approval from the Medical Research Advisory 
Committee of the Government of Papua New Guinea, the National Hospital for 
Neurology and Neurosurgery and the UCL Institute of Neurology Joint Research 
Ethics Committee (now National Research Ethics Service Committee, London - 
Queen Square); REC references: 03/N036, 03/N038 and 03/N133. Work with mice 
was performed under approval and licence granted by the UK Home Office 
(Animals (Scientific Procedures) Act 1986); Project Licence number 70/6454 
which conformed to University College London institutional and ARRIVE guide- 
lines (http://www.nc3rs.org.uk/ARRIVE/). 

Generation of transgenic mice. The 759-bp human PrP ORF was amplified by 
PCR with pfu polymerase from genomic DNA prepared from the brain ofa patient 
with the V127 polymorphism on the M129 allele, using forward primer 5’- 
GTCGACCAGTCATTATGGCGAACCTT-3’ and reverse primer 5’- 
CTCGAGAAGACCTTCCTCATCCCACT-3’. Restriction sites Sall (GT'CGAC) 
and Xho I (CTCGAG) (underlined) were introduced in the forward and reverse 
primers respectively for cloning. The blunt-ended PCR fragment generated by pfu 
polymerase was subcloned into Smal-digested pSP72 vector and sequenced to 
ensure that no spurious alterations had been introduced by the PCR and to 
confirm the presence of V127 and M129 polymorphisms matching the patient 
DNA template. The amplified human PrP ORF with the confirmed polymorph- 
isms was then isolated by Sall and Xho I digestion. Subsequent subcloning into the 
Sall site of the cosmid vector SHaCosTt, packaging and preparation of high quality 
DNA of the Not I transgene insert was as previously reported'*. Microinjection of 
the purified Not I DNA fragment was carried out according to standard protocol 
into single cell eggs of Prnp null mice which had been backcrossed onto an FVB/N 
genetic background. Genotyping was performed by PCR and PrP expression levels 
estimated by western blot analysis as previously reported'*”’. Two homozygous 
lines were established for variant HuPrP V127M129 described as Tg(HuPrP 
V127M129/V127M129 Prnp””)-183 (V127M129 Tgl83) and Tg(HuPrP 
V127M129/V127M129 Prnp”°)-190 (V127M129 Tg190), with PrP© expression 
levels of 2 and 1 times, respectively, that of pooled 10% (w/v) normal human brain 
homogenate (Table 1). FVB-congenic versions of Tg35 mice homozygous for 
HuPrP G127M129 (refs 12-15) and Tgl152 mice homozygous for HuPrP 
G127V129 (refs 15-19) were used as wild-type human PrP-expressing controls, 
designated Tg(HuPrP G127M129/G127M129 Prnp”°)-35¢ (G127M129 Tg35c) 
and Tg(HuPrP G127V129/G127V129 Prnp®”)-152c (G127V129 Tgl52c), 
respectively. The PRNP codon 127-129 genotypes and relative expression levels 
of wild-type and variant PrP in the parental transgenic lines and in the F, crosses 
are shown in Table 1. 

Transmission studies. Strict biosafety protocols were followed. Inocula were 
prepared, using disposable equipment for each inoculum, in a microbiological 
containment level 3 laboratory and inoculations performed within a class 1 


microbiological safety cabinet as described previously'*”. Ten mice per group 
of control G127M129 Tg35c and G127V129 Tg152c lines and 15 per group of 
newly generated V127M129 Tg183 and V127M129 Tg190 lines and their respect- 
ive crosses with Tg35c and Tg152c were inoculated (see below) with human brain 
homogenates from neuropathologically confirmed patients comprising four kuru 
cases, eight sporadic CJD cases, four iatrogenic CJD cases and two cases of vCJD. 
The genotype of each mouse was confirmed by PCR of ear punch DNA before 
inclusion and all mice were uniquely identified by sub-cutaneous transponders. 
Disposable cages were used and all cage lids and water bottles were also uniquely 
identified by transponder and remained with each cage of mice throughout the 
incubation period. Care of the mice was according to institutional guidelines. Mice 
(female, aged 6-8 weeks) were randomly assigned to experimental groups and 
anaesthetized with a mixture of halothane and O,, and intracerebrally inoculated 
into the right parietal lobe with 30 jl of a 1% (w/v) brain homogenate prepared in 
Dulbecco’s phosphate buffered saline lacking Ca** or Mg”* ions (D-PBS). All 
mice were thereafter examined daily for early indicators of clinical prion disease 
including piloerection, sustained erect ears, intermittent generalized tremor, 
unsustained hunched posture, rigid tail, mild loss of coordination, and clasping 
hind legs when lifted by the tail. Definite diagnosis of clinical prion disease (trig- 
gering experimental end point) was reached if mice exhibited any two early indi- 
cator signs in addition to one confirmatory sign, or any two confirmatory signs. 
The confirmatory signs included ataxia, impairment of righting reflex, dragging of 
hind limbs, sustained hunched posture, or prominent abnormal breathing. Mice 
were killed (by CO, asphyxiation) if they exhibited any signs of distress or once a 
diagnosis of prion disease was established. At post-mortem, brains from inocu- 
lated mice were removed, divided sagittally with half frozen and half fixed in 10% 
buffered formol saline. Subsequent immunohistochemical or biochemical investi- 
gations were performed blind to sample provenance. 

Neuropathology and immunohistochemistry. Fixed brain was immersed in 98% 
formic acid for 1 h and paraffin wax embedded. Serial sections of 4 tm thickness 
were pre-treated by boiling for 10 min in a low ionic strength buffer (2.1 mM Tris, 
1.3 mM EDTA, 1.1 mM sodium citrate, pH 7.8) before exposure to 98% formic 
acid for 5 min. Abnormal PrP accumulation was examined using anti-PrP mono- 
clonal antibody ICSM 35 (D-Gen Ltd, London) on a Ventana automated immuno- 
histochemical staining machine (Ventana Medical Systems Inc., Tucson, Arizona) 
using proprietary secondary detection reagents (Ventana Medical Systems Inc.) 
before development with 3’3 diaminobenzedine tetrachloride as the chromogen”. 
Harris haematoxylin and eosin staining was done by conventional methods. 
Appropriate controls were used throughout. 

Immunoblotting. Preparation of brain homogenates (10% (w/v) in D-PBS), pro- 
teinase K digestion (75 jg ml _* for 1 h at 37 °C) and subsequent immunoblotting 
was performed as described previously***°. Blots were probed with anti-PrP 
monoclonal antibody ICSM 35 (D-Gen Ltd, London) in conjunction with an 
anti-mouse IgG-alkaline phosphatase conjugate and development in chemilumin- 
escent substrate (CDP-Star; Tropix Inc.). Primary screening of brain homogenates 
was performed blind to sample identity. 


29. Asante, E. A. et a/. Absence of spontaneous disease and comparative prion 
susceptibility of transgenic mice expressing mutant human prion proteins. J. Gen. 
Virol. 90, 546-558 (2009). 

30. Wadsworth, J. D. et a/. Molecular diagnosis of human prion disease. Methods Mol. 
Biol. 459, 197-227 (2008). 
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Extended Data Figure 1 | Immunoblots showing relative PrP expression 
levels in transgenic mice. The provenance of each brain sample is designated 
above each lane and molecular markers are indicated on the left. The PRNP 
codon 127 (G, glycine, V, valine) or codon 129 (M, methionine, V, valine) 
genotypes of the transgenic mice are designated below. Transgenic mouse 
brains were analysed by enhanced chemiluminescence without proteinase-K 
digestion and equal amounts of total protein loaded in each well and probed 
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with anti-PrP monoclonal antibody ICSM 35. Wild-type human PrP 
expression levels of G127M129/G127M129 Tg35c and G127V129/G127V129 
Tg152c mice are two- and sixfold higher, respectively, than seen in 10% (w/v) 
pooled human brain homogenate. Homozygous V127M129 Tg183 and Tg190 
mice have twofold higher or equivalent PrP expression levels, respectively, 
compared to 10% (w/v) pooled human brain homogenate. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Cc 


vCJD T4 MM 1342 


vCJD T4 MM 17042 


Extended Data Figure 2 | Immunohistochemical analysis of transgenic 
mouse brain following challenge with vCJD prions. All mice were 
intracerebrally challenged with the same vCJD prion isolates (1342 and 17042). 
Abnormal PrP deposition in fixed post-mortem brain from affected mice was 
detected using anti-PrP monoclonal antibody ICSM 35. a, b, Homozygous 
Tg35c mice (expressing G127M129/G127M129 wild-type PrP only) show 
intense and widespread PrP plaque deposits. Magnified areas show 
hippocampus (a, panel i), frontal cortex (b, panel i) and thalamus (a, panel ii, 
and b, panel ii). ¢, d, In contrast, heterozygous G127M129/V127M129 Tg35c/ 


Tg35c/Tg183 


Tg35c/Tg190 


m %& 


Tg183 mice expressing equivalent levels of G127M129 and V127M129 PrP 
show only weak PrP deposition in the corpus callosum (c, panel i, and d, panel i) 
with no abnormal PrP deposition detected in other brain areas, for example, in 
the thalamus (c, panel ii, and d, panel ii). e, f, Heterozygous G127M129/ 
V127M129 Tg35c/Tg190 mice which express a lower level of V127M129 PrP 
relative to wild-type G127M129 PrP show greater levels of PrP deposition than 
seen in Tg35c/Tg183 mice following challenge with the same vCJD prion 
isolates. e (panels i and ii) and f (panel i), corpus callosum; f (panel ii), pons. 
Scale bar: upper panels, 2 mm; magnified panels, 100 im. 
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Extended Data Figure 3 | Immunohistochemical analysis of homozygous V127M129 Tg190 mice expressing endogenous levels of V127M129 PrP. Red 
PRNP V127M129/V127M129 transgenic mouse brain following challenge square boxes in the main panels (a-h) define the area of magnified images of 
with human prions. Mice were intracerebrally challenged with kuru, classical hippocampus (panel i) and thalamus (panel ii) shown in the lower panels. Scale 
and variant CJD prions. Following prolonged (>600 days) post-inoculation bars, 2 mm; magnified panels i and ii, 100 tum. The lack of detection of 
periods, abnormal PrP deposition in fixed post-mortem brain was examined — abnormal PrP deposition in brain indicates that the mice are not subclinically 
using anti-PrP monoclonal antibody ICSM 35. a-d, V127M129/V127M129 infected with prions. 

Tg183 mice with twofold overexpression of V127M129 PrP. e-h, V127M129/ 
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Infection with kuru and classical CJD prions 


Homozygotes Tg35c Tg152c Tg183 or Tg190 
G127M129 / G127M129 G127V129 / G127V129 V127M129 / V127M129 
g La : - Key 
cee Pec cen oes Degree of susceptibility to 
TO oo a bic was : infection 
"tyre ae ee : Mé Full 
Y wae i a:" ae Y WA oo 
i. i ot No bar Entirely resistant 
Heterozygotes G127M129 / G127V129 G127M129 / V127M129 G127V129 / V127M129 
PrP G127 G127:V127 G127:V127 G127:V127 G127:V127 
expression 100% (2:1) (1:1) (6:1) (3:1) 


Extended Data Figure 4 | Schematic diagram summarizing transmissions 
of kuru and classical CJD prions to transgenic mice expressing human PrP. 
G127M129/G127M129 Tg35c mice or G127V129/G127V129 Tg152c mice are 
homozygous for wild-type human PrP alleles and are fully susceptible to kuru 
and classical CJD prions. V127M129/V127M129 Tg183 or V127M129/ 
V127M129 Tg190 transgenic mice are homozygous for the variant V127M129 
allele found only in humans from the kuru-exposed population of Papua New 
Guinea and are entirely resistant to infection with kuru and classical CJD 
prions. The levels of PrP expression in the brain of these homozygous 
transgenic mice relative to a pooled human brain homogenate are 1X (Tg190 


mice), 2X (Tg35c and Tg183 mice) and 6X (Tg152c mice). Generation of F; 
mice through inter-breeding the various homozygous lines produces different 
combinations of the various human PrP alleles leading to differences in the 
relative expression levels of the various prion proteins in brain. The PrP 
expression ratios from the two PrP alleles in the crosses are shown in 
parentheses above the mouse sketches. Full, intermediate or low susceptibility 
of the mice to infection with kuru and classical CJD prions is indicated by three, 
two or one diagonal red bar, respectively, drawn across the mice. Mice with no 
red bar are entirely resistant to infection with kuru and classical CJD prions. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Tg152c Tg152c/Tg183 Tg152c/Tg190 


Kuru T3 MV 110336 


iCJD T3 MV 1020 


Extended Data Figure 5 | Immunohistochemical analysis of homozygous ii in panels a-f mark brain areas that are magnified and displayed below; i, 
PRNP G127V129/G127V129 and heterozygous G127V129/V127M129 cortex; ii, thalamus. a, b, Wild-type G127V129/G127V129 Tg152c mice. 
transgenic mouse brain following challenge with human prions. Mice were __ c, d, Heterozygous G127V129/V127M129 Tg152c/Tg183 mice. 
intracerebrally challenged with kuru and classical CJD prions, and abnormal __ e, f, Heterozygous G127V129/V127M129 Tg152c/190 mice. Scale bars, 2 mm; 
PrP deposition in fixed post-mortem brain from affected mice was examined _ magnified panels i and ii, 100 jim. The detection of abnormal PrP deposition in 
using anti-PrP monoclonal antibody ICSM 35. Red square boxes labellediand _ brain indicates that the mice are infected with prions. 
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The Drosophila TNF receptor Grindelwald couples 
loss of cell polarity and neoplastic growth 


Ditte S. Andersen!??**, Julien Colombani 23,454 Valentina Palmerini°, Krittalak Chakrabandhu’?*°, Emilie Booneb?**, 
Michael Réthlisberger’, Janine Toggweiler’, Konrad Basler’, Marina Mapelli®, Anne-Odile Hueber'**° & Pierre Léopold''?>** 


Disruption of epithelial polarity is a key event in the acquisition of 
neoplastic growth. JNK signalling is known to play an important 
part in driving the malignant progression of many epithelial 
tumours, although the link between loss of polarity and JNK sig- 
nalling remains elusive. In a Drosophila genome-wide genetic 
screen designed to identify molecules implicated in neoplastic 
growth’, we identified grindelwald (grnd), a gene encoding a trans- 
membrane protein with homology to members of the tumour nec- 
rosis factor receptor (TNFR) superfamily. Here we show that Grnd 
mediates the pro-apoptotic functions of Eiger (Egr), the unique 
Drosophila TNF, and that overexpression of an active form of 
Grnd lacking the extracellular domain is sufficient to activate 
JNK signalling in vivo. Grnd also promotes the invasiveness of 
Ras‘'?/scrib~’~ tumours through Egr-dependent Matrix metallo- 
protease-1 (Mmp1) expression. Grnd localizes to the subapical 
membrane domain with the cell polarity determinant Crumbs 
(Crb) and couples Crb-induced loss of polarity with JNK activation 
and neoplastic growth through physical interaction with Veli (also 
known as Lin-7). Therefore, Grnd represents the first example of a 
TNFR that integrates signals from both Egr and apical polarity 
determinants to induce JNK-dependent cell death or tumour 
growth. 

We recently carried out a genome-wide screen to identify molecules 
that are required for neoplastic growth’. The condition used for this 
screen was the disc-specific knockdown of avalanche (rotund 
(rn)>avl-RNAi; avalanche also known as syntaxin 7), a gene encoding 
a syntaxin that functions in the early step of endocytosis*. rn>avl- 
RNAi results in ectopic Wingless (Wg) expression, neoplastic disc 
overgrowth’ (Fig. la, b), and a 2-day delay in larva-to-pupa transition’. 
We screened a collection of 10,100 transgenic RNA interference 
(RNAi) lines for their ability to rescue the pupariation delay and iden- 
tified 121 candidate genes’. Interestingly, only eight candidate genes 
also rescued ectopic Wg expression and neoplastic overgrowth 
(Extended Data Fig. 1a). These included five lines targeting core com- 
ponents of the JNK pathway (Bendless, Tab2, Tak1, Hemipterous and 
Basket; Extended Data Fig. 1b). Using a puckered enhancer trap (puc- 
lacZ) as a readout for JNK activity, we confirmed that JNK signalling is 
highly upregulated in rn>avl-RNAi discs (Fig. 1a, b, right). One of the 
remaining lines targets CG10176, a gene encoding a transmembrane 
protein. Reducing expression of CG10176 by using two different 
RNAi lines was as efficient as tak1 silencing to restore normal Wg 
pattern and suppresses JNK signalling and neoplastic growth in the 
rn>avl-RNAi background (Fig. 1c, d and Extended Data Fig. 1c-e). 
Sequence analysis of GC10176 identified a cysteine-rich domain 
(CRD) in the extracellular part with homology to vertebrate TNFRs 
(Fig. le and Extended Data Fig. 2) harbouring a glycosphingolipid- 
binding motif (GBM) characteristic of many TNFRs including Fas’ 


(Fig. le and Extended Data Fig. 2). We named CG10176 grindelwald 
(grnd), after a village at the foot of Eiger, a Swiss mountain that lent its 
name to the unique Drosophila TNF, Egr. Immunostaining and sub- 
cellular fractionation of disc extracts confirmed that Grnd localizes to 
the membrane (Extended Data Fig. 1f-h). Moreover, co-immunopre- 
cipitation experiments showed that both Grnd full-length and Grnd- 
intra, a form lacking its extracellular domain, directly associate with 
Traf2, the most upstream component of the JNK pathway (Fig. 1f-h). 
This interaction is disrupted by a single amino acid substitution within 
a conserved Traf6-binding motif* (human TRAF6 is the closest homo- 
logue to Traf2; Fig. 1f-h and Extended Data Fig. 3). Overexpression of 
Grnd-intra, but not full-length Grnd, is sufficient to induce JNK sig- 
nalling, ectopic Wg expression and apoptosis (Fig. li-j and Extended 
Data Fig. 4a—-c), and Grnd-intra-induced apoptosis is efficiently sup- 
pressed ina hep” (JNKK) mutant background (Extended Data Fig. 4d, 
e), confirming that Grnd acts upstream of the JNK signalling cascade. 

The Drosophila TNF Egr activates JNK signalling and triggers cell 
death or proliferation, depending on the cellular context*. We there- 
fore tested whether Grnd is required for the small-eye phenotype 
generated by Egr-induced apoptosis in the retinal epithelium®’ 
(GMR>egr; Fig. 2a, b). As previously shown, inhibition of JNK sig- 
nalling by reducing tak1 (ref. 6) or traf2 (ref. 8) expression, or by 
overexpressing puckered’, blocks Egr-induced apoptosis and rescues 
the small-eye phenotype (Extended Data Fig. 5a—d). In contrast to a 
previous report”, RNAi silencing of wengen (wgn), a gene encoding a 
presumptive receptor for Egr, does not rescue the small-eye phenotype 
(Extended Data Fig. 5e). Furthermore, the small-eye phenotype is not 
modified in a wgn-null mutant background (Fig. 2c and Extended Data 
Fig. 5h, m, n), confirming that Wgn is not required for Egr-induced 
apoptosis in the eye. By contrast, reducing grnd levels partially rescues 
the Egr-induced small-eye phenotype, producing a ‘hanging-eye’ 
phenotype (Fig. 2d) that is not further rescued in a wgn-knockout 
(wenk ) mutant background (Extended Data Fig. 5e-i). A similar 
phenotype was previously reported as a result of non-autonomous cell 
death induced by a diffusible form of Egr'® (Extended Data Fig. 5k, 1). 
This suggests that Grnd prevents Egr from diffusing outside of its 
expression domain. Co-immunoprecipitation experiments show that 
both full-length Grnd and Grnd-extra, a truncated form of Grnd lack- 
ing the cytoplasmic domain, associate with Egr through its TNF- 
homology domain (Fig. 2j, k). Although Grnd-extra can bind Egr, it 
cannot activate JNK signalling. Therefore, we reasoned that Grnd- 
extra expression might prevent both cell-autonomous and non-auto- 
nomous apoptosis by trapping Egr and preventing its diffusion and 
binding to endogenous Grnd. Indeed, GMR-Gal4-mediated express- 
ion of grnd-extra fully rescues the Egr small-eye phenotype (Fig. 2e and 
Extended Data Fig. 5j). To confirm that the removal of Grnd induces 
Egr-mediated non-autonomous cell death, we generated wing disc 
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Figure 1 | Grnd is a novel member of the TNFR superfamily. a—d, Wing 
discs stained for W¢ (left) or LacZ (right). e, Sequence alignment of Grnd and 
human TNERs. Blue, identity; yellow, strong similarity to Grnd; grey, weak 
similarity to Grnd. f, g, Co-immunoprecipitation experiments from $2R* cells 
co-expressing haemagglutinin (HA)-Traf2 or Ihog—HA (control), and either 
Grnd-Flag, Grnd-intra—Flag or Grnd(E202A)-Flag (f), or HA—Egr and either 
Grnd-Flag, Grnd(E202A)—Flag or Ihog—Flag (control) (g). Input loading is 
quantified using B-tubulin. h, Pull-down experiments of Traf2 with various 
truncated forms of glutathione S-transferase (GST)-Grnd. Commassie blue 
staining is used to visualize the bound species. WT, wild type. i-j, Wing discs 
stained for Wg (left), LacZ (middle) and Grnd (right). 


clones expressing egr alone, egr + tak1 RNAi, or egr + grnd RNAi 
(Fig. 2f-i). As expected, reducing tak1 levels in egr-expressing clones 
prevents their elimination by apoptosis (Fig. 2f, g). Similarly, reducing 
grnd levels prevents autonomous cell death, but also induces non- 
autonomous apoptosis (Fig. 2h, i). This suggests that Egr, like its 
mammalian counterpart TNF-«, can be processed into a diffusible 
form in vivo whose interaction with Grnd limits the potential to act 
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Figure 2 | Grnd is required for Egr-induced apoptosis. a-e, Light 
micrographs of Drosophila adult eyes of the indicated genotypes (all animals 
carry the GMR-Gal4 driver). Original magnification, X30. f-i, Wing discs 
carrying green fluorescent protein (GFP)-labelled clones (green) expressing egr 
(f), egr + tak1 RNAi (g) or egr + grnd RNAi (h, i) and stained for Caspase 3 
(Cas3; red). A close-up shows non-autonomous cell death around the egr + 
grnd RNAi clones. j, k, Co-immunoprecipitation (IP) experiments from S2R* 
cells expressing HA-Egr or HA-EgrATNF (Egr lacking its TNF-homology 
domain; amino acids 278-415) and either Grnd-Flag, Grnd-extra-Flag or 
Thog-Flag (control). Input loading is quantified using B-tubulin. 


at a distance. Flies carrying homozygous (grnd™"°"™""**) or transhe- 


terozygous (grnd“"°”"/) combinations of a transposon inserted in the 
grnd locus express no detectable levels of Grnd protein (Extended Data 
Fig. 6a, b) and are equally resistant to Egr- induced cell death (Extended 
Data Fig. 6e-j). In addition, grnd“"°"™""** mutant flies are viable and 
display no obvious phenotype (Extended Data Fig. 6c, d), suggesting 
that Grnd, like Egr, participates in a stress response to limit organismal 
damage. Collectively, our data demonstrate that Grnd is a new 
Drosophila TNF receptor that mediates most, if not all, Egr-induced 
apoptosis. 

TNFs probably represent a danger signal produced in response to 
tissue damage to rid the organism of premalignant tissue or to facilitate 
wound healing. Disc clones mutant for the polarity gene scribbled 
(scrib) induce an Egr-dependent response resulting in the elimination 
of scrib mutant cells by JNK-mediated apoptosis''’*. To test the 
requirement for Grnd in this process, we compared scrib-RNAi and 
scrib-RNAi + grnd-RNAi clones obtained 72 h after heat shock induc- 
tion. As expected, scrib-RNAi cells undergo apoptosis and detach from 
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scrib RNAi + 
rh3 RNAi 


scrib-RNAi + 
grnd RNAi, 


Figure 3 | Grnd is required for the Egr-induced invasiveness of Ras’!7/ 


scrib-’~ tumours. a-f, Wing discs dissected 5 days after egg deposition (AED) 
stained for Cas3 (red) and carrying GFP-labelled clones (green) of the indicated 
genotypes induced 2 days AED. a, b, e, Wild-type clones (a) and clones 
expressing the control Rh3 RNAi (b) to exclude a Gal4 titration effect (e). g-i, 
Eye-brain complexes dissected 7 days AED carrying Ras’’7/scrib ‘~ (g) or 
Ras !?/scrib /~ + grnd RNAi (h) clones (labelled by GFP), or Ras‘ !?/scrib-/— 
GFP-labelled clones in an egr mutant background (i) stained for Mmp1 

(g-i, middle, red) and 4',6-diamidino-2-phenylindole (DAPI; blue). 
Ras’"?/scrib ’~ clones invade the ventral nerve cord (g, white arrow). c.c., 
cephalic complex. 
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the epithelium (Fig. 3a—d). By contrast, scrib-RNAi clones with reduced 
grnd expression survive (Fig. 3e, f), indicating that Grnd is required 
for Egr-dependent elimination of scrib-RNAi cells. Similar results 
were obtained by generating scrib-‘~ mutant clones in the eye disc 
(Extended Data Fig. 7a-d). 

In both mammals and flies, TNFs are double-edged swords that also 
have the capacity to promote tumorigenesis in specific cellular con- 
texts’. Indeed, scrib’~ eye disc cells expressing an activated form of 
Ras (Ras”*) exhibit a dramatic tumour-like overgrowth and meta- 
static behaviour, a process that critically relies on Egr'*. Ras’'?/ 
scrib’~ metastatic cells show a strong accumulation of Grnd and 
Mmpl, and invade the ventral nerve cord'’’*’* (Fig. 3g and 
Extended Data Figs 7e, 8a). Primary tumour cells reach peripheral 
tissues such as the fat body and the gut, where they form micro- 
metastases expressing high levels of Grnd (Extended Data Fig. 7f, g). 
Reducing grnd levels in Ras’ '’/scrib-‘~ clones is sufficient to restore 
normal levels of Mmp1 and abolish invasiveness in a way similar to 
that observed in an egr ‘~ background" (Fig. 3h, iand Extended Data 
Fig. 8a—c). Therefore, Grnd is required for the Egr-induced metastatic 
behaviour of Ras’ '7/scrib-‘~ tumorous cells. Similarly, reducing grnd, 
but not wgn levels, strongly suppresses Mmp1 expression in Ras’ ’7/ 
dig-RNAi cells and limits tumour invasion (Extended Data Fig. 8d-g), 
indicating that Wgn does not have a major role in the progression of 
these tumours. 

Perturbation of cell polarity is an early hallmark of tumour progres- 
sion in epithelial cells. In contrast to small patches of polarity-deficient 
cells, for example, scrib-’~ clones, organ compartments or animals 
fully composed of polarity-deficient cells become refractory to Egr- 
induced cell death and develop epithelial tumours. The formation of 
these tumours requires JNK/MAPK signalling, but not Egr’*"*, sug- 
gesting Egr-independent coupling between loss of polarity and JNK/ 
MAPK-dependent tumour growth. In line with these observations, 
we noticed that, in contrast to Grnd, Egr is not required to drive 
neoplastic growth in rn>avl-RNAi conditions (Extended Data 
Fig. 9a, b). This suggests that, in addition to its role in promoting 
Egr-dependent functions, Grnd couples loss of polarity with JNK- 
dependent growth independently of Egr. Disc immunostainings 
revealed that Grnd co-localizes with the apical determinant Crb 
in the marginal zone, apical to the adherens junction protein 
E-cadherin (E-cad) and the atypical protein kinase C (aPKC; Fig. 4a 
and Extended Data Fig. 9c, d). In avl-RNAi discs, Grnd and Crb 
accumulate in a wider apical domain* (Extended Data Fig. 9e, g). 
Apical accumulation of Crb is proposed to be partly responsible for 
the neoplastic growth induced by avl knockdown, since overexpres- 
sion of Crb or a membrane-bound cytoplasmic tail of Crb (Crb-intra) 
mimics the avl-RNAi phenotype’ (Extended Data Fig. 9f). We there- 
fore examined whether Grnd might couple the activity of the Crb 
complex with JNK-mediated neoplastic growth. Indeed, reducing grnd 
levels, but not wgn, in rn>crb-intra discs suppresses neoplastic growth 
as efficiently as inhibiting the activity of the JNK pathway (Fig. 4c-f 
and Extended Data Fig. 9h, i). Notably, Yki activation is not rescued in 
these conditions (Extended Data Fig. 9j-l), illustrating the ability of 
Crb-intra to promote growth independently of Grnd by inhibiting 
Hippo signalling through its FERM-binding motif (FBM)!””’. 
Indeed, neoplastic growth and polarity defects induced by a form of 
Crb-intra lacking its FBM (CrbAFBM-intra) are both rescued by Grnd 
silencing (Fig. 4i, j). As expected, the size of rn>crbAFBM-intra;grnd- 
RNAi discs is reduced compared to the size of rn>crb-intra; grnd- 
RNAi discs (compare Fig. 4e, f with Fig. 4)). 

Crb, Stardust (Sdt; PALS1 in humans), and Palsl-associated tight 
junction protein (Patj) make up the core Crb complex”, which recruits 
the adaptor protein Veli (MALS1-3 in humans)”. In agreement 
with previous yeast two-hybrid data’’, we find that Grnd binds directly 
and specifically to the PDZ domain of Veli through a membrane- 
proximal stretch of 28 amino acids in its intracellular domain 
(Extended Data Fig. 10a-e). Grnd localization is unaffected in 
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Figure 4 | grnd is required downstream of the apical Crb polarity complex 
for JNK-dependent neoplastic growth. a, b, Transverse sections of wild-type 
discs stained for Grnd (red), Crb (a, green) or Veli (b, green). c-h, Discs 
expressing crb-intra alone (c), or combined with the indicated genotypes 
(d-h), dissected 6 days AED, stained for Wg (green). i, j, Discs expressing 


crb-’~ and veli RNAi mutant clones (Extended Data Fig. 10f-h). 
However, reducing veli expression rescues the patterning defects and 
disc morphology of rn>crb-intra mutant cells (Fig. 4g, h), suggesting 
that Grnd couples Crb activity with JNK signalling through its inter- 
action with Veli. Interestingly, aPKC-dependent activation of JNK 
signalling’® also depends on Grnd (Fig. 4k, 1). aPKC is capable of 
directly binding and phosphorylating Crb, which is important for 
Crb function**. This suggests that aPKC, either directly or through 
Crb phosphorylation, activates Grnd-dependent JNK signalling in res- 
ponse to perturbation of apico-basal polarity. 

Our data are consistent with a model whereby Grnd integrates 
signals from Egr, the unique fly TNF, and apical polarity determi- 
nants to induce JNK-dependent neoplastic growth or apoptosis in a 
context-dependent manner (Fig. 4m). Recent work reveals a correla- 
tion between mammalian Crb3 expression and tumorigenic potential 
in mouse kidney epithelial cells”. The conserved nature of the Grnd 
receptor suggests that specific TNFRs might carry out similar func- 
tions in vertebrates, in which the link between apical cell polarity and 
tumour progression remains elusive. 
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JNK signalling 


—~ 


Apoptosis 


grnd RNAi 
+0RR: : 


Proliferation 


Crb-intra lacking its FBM (crb4FBM-intra) alone (i) or with grnd RNAi 

(j) stained for Wg (in green). k, 1, Discs from puc-LacZ animals expressing 
activated aPKC (aPKCAN)” in the posterior domain alone (k) or together with 
grnd RNAi (1). m, A model for Grnd function. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Fly strains and food. Animals were reared at 25 °C on fly food containing, per 
litre: 17 g inactivated yeast powder, 83 g corn flour, 10 g agar, 60 g white sugar and 
4.6 g Nipagin M (in ethanol). The rn-Gal4, UAS-egr, tubGAL80" animals were 
raised at 18 °C, shifted to 30°C for 40h during early third instar larval develop- 
ment (7 days AEL), and subjected to dissection immediately after (Extended Data 
Fig. 6). 

Cell culture. Drosophila $2 and S2R™ cells were grown in complete Schneiders 
medium (CSM; Schneiders medium (Invitrogen) supplemented with 10% heat- 
inactivated FBS (BioWhittaker), 50 U ml’ penicillin and 50 pg ml streptomy- 
cin (Invitrogen)) at 25°C. Transfections were done using Effectene reagent 
(Qiagen). 

Plasmids. grnd, grnd-extra and grnd-intra coding sequences were PCR amplified 
from BDGP EST complementary DNA clones RE28509, and cloned into the 
pENTR/D-TOPO vector using the following gene-specific primers: sense primers 
CACCATGTCGGTCAGGAAGTTGAG (grnd and grnd-extra) and CACCA 
TGCTGATCCTCTTGCTGCTGACTATCEC (grnd-intra); and antisense primers 
GAAAGCGACGGGAATCGTCGC (grnd and grnd-intra) and CCATCGAAGG 
AACTGAAACGGG (grnd-extra). The mutations giving rise to the single E202A 
and double P200A/E202A amino acid changes in Grnd and Grnd’?>**" were 
introduced using the QuickChange site-directed mutagenesis kit (Stratagene), 
grnd-pENTR or grnd'*>-*"'-pGEX-6PI templates, and the following primers: sense, 
CGGTGAGCACTCGCTATCCGGCGGCAAACAGTACCACGCCCAC; and 
antisense, GTGGGCGTGGTACTGTTTGCCGCCGGATAGCGAGTGCTCAC 
CG for grnd’?4; sense, CGGTGAGCACTCGCTATGCGGCGGCAAACAGT 
ACCACGCCCAC; and antisense, GTGGGCGTGGTACTGTTTGCCGCCGCA 
TAGCGAGTGCTCACCG for grnd??0?4"?4. To generate grnd-Flag, grnd*?°"4- 
Flag, grnd-extra-Flag and grnd-intra-Flag, grnd, grnd’?”4, grnd-extra and grnd- 
intra coding sequences were cloned into the Gateway Destination vector pActin- 
3XFlag (Drosophila Gateway Vector Collection) for expression in S2 and S2R* 
cells. In addition, grnd, grnd-intra and grnd-extra coding sequences were cloned 
into pUASg.attB for generation of UAS-grnd, UAS-grnd-intra and UAS-grnd- 
extra transgenic lines. The HA-traf2 plasmid was from K.B.””. The HA-egr and 
HA-egrATNF plasmids were a gift from M. Miura. pActin-iHog-HA and pActin- 
iHog-Flag were gifts from L. Ruel. 

Transgenic flies. To generate the UAS-grnd UAS-grnd-intra and UAS-grnd-extra 
lines, the pUASattB-grnd, pUASattB-grnd-intra and pUASattB-grnd-extra con- 
structs were introduced into the germ line by injections in the presence of the 
PhiC31 integrase and inserted in the 86F8-landing site on the 3R chromosome 
(Bloomington Drosophila Stock Center, BL24749, BestGene). The following RNAi 
lines were from the GD or KK collections of the Vienna Drosophila RNAi Center 
(VDRC): grnd RNAi (GD43454, sense primer CGCGAATTCCCTCTTGCTG 
CTGACTATCCTTCTCGT, and antisense primer GCGTCTAGAGTTGTGG 
GCGTGGTACTGTTTTCC; and KK104538, sense primer TGGCGCCCCTAG 
ATGAACGGTTATGACGGAGATCG, and antisense primer CGCATGTAGC 
CTGCCGTTTGTGGTGATTGCGAATG), tak1 RNAi (KK101357, sense primer 
TGGCGCCCCTAGATGGCTTGTCCTTGAAGTCCTGC, and antisense primer 
CGACGCCCGCTGATAGCATGAGCAGGAGATTGTCA), — bendless RNAi 
(KK109638, sense primer TGGCGCCCCTAGATGCGCATCATCAAGGAGA 
CTCA, and antisense primer CGCATGTAGCCTGCCTTCTGGAAAAACGCC 
GATAC), tab2 RNAi (KK100326, sense primer TGGCGCCCCTAGATGGCA 
CTTTCAGCGGAAGAAAC, and antisense primer TAGCCTCCCTAGCG 
CAGCTCTGCACTGGGATAGGA), traf2 RNAi (GD16125, sense primer CGC 
GAATTCAGCACCCACTCCGCCCAAGA, and antisense primer GCGTCTAG 
AACCGAACAGCCCAGCGAGGA), hep RNAi (KK109277, sense primer TGG 
CGCCCCTAGATGGGCTTCTTGGACAGCTTGAG, and antisense primer CG 
CATGTAGCCTGCCAGGAGAACAAACGCATCCTG), bsk RNAi (KK104569, 
sense primer TGGCGCCCCTAGATGAGCATCCACTTCCTCAGCAT, and 
antisense primer TAGGTCTAGCCCCGCGGTACACCATCACCCTCGTT), 
scrib RNAi (GD 45556, sense primer CGCGAATTCCGAAGAGGACAACG 
CACAGCA, and antisense primer GCGTCTAGACTGGGAGCGGGCAAAGT 
GTAAG), avl (GD5413, sense primer CGCGAATTCCAACAGTAGCGCCAG 
CCAGCA, and antisense primer CGCTCTAGACCGTGCCCTGCGAGACA 
AAA), dilp8 (KK102604, sense primer TGGCGCCCCTAGATGACTCGATG 
GACCTTCTGTCG, and antisense primer CGCATGTAGCCTGCCGTCATCG 
GAGTCTGTTGCCT), and rh3 RNAi (KK100853, sense primer TGGCGCCCC 
TAGATGATCTGGGATGACTTATGGCG, and antisense primer TAGCCTCC 
CTAGCGCTCGATACGCGACTCTTTGTG). The veli-RNAi lines were from 
U. Thomas. The wengen-RNAi line (wengen-IR) was provided by M. Miura. 
The GMR-Gal4 (on chromosome II), UAS-egr (on chromosome III), GMR- 
Gal4, UAS-ecto-egr60 (on chromosome II), GMR-Gal4, UAS-egr (on chromo- 
some II), UAS-CrbAFBM-intra, DIAP-GFP, aPKCAN, FRT, Crb''™? (null) and 
UAS-wgn transgenic lines were previously published”!’°, the UAS-puc™ line 
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was described previously*, the UAS-crb-intra transgenic lines and veli®* mutant 
flies were from E. Knust, the egr' and egr’ mutant lines and the UAS-ras’"; 
FRT82B, scrib'/TM6b and UAS-ras”"’, egr’; FRT82B, scrib'/TM6b and yw, 
eyFlp1; act>y+>Gal4, UAS-GFP; FRT82B, tubgal80 and yw, eyFlpl; 
act>y+>Gal4, UAS-GFP, egr’; FRT82B, tubGal80 lines were from M. Vidal, 
the w!778, +; rnGal4, UAS-eiger, tubGal80"/TM6B line was provided by I. 
Hariharan, the yw, hep75/FM7, GFP; Sal’®/cyo line was provided by F. Serras. 
The rn-Gal4, nub-Gal4, puc-LacZ, grnd™ mos (Mi{MIC}CG10176M?"), and 
grnd ‘BSCl49) lines were provided by the Bloomington Drosophila Stock Center. 
Flies’ eye pictures are acquired on a Leica Fluorescence Stereomicroscope MZ16 
FA with a Leica digital camera DFC 490. 
Generation of wgn*°”“? mutant flies. The wgn knockout allele was generated by 
ends-in gene targeting as described previously*'. The pT V2 targeting construct 
carried frameshift mutations in exon 1 and 2, an I-Scel site between these two 
exons, a white’ marker gene, an I-Crel site, up- and downstream homology 
sequences and two flanking FRT sites. The frameshift mutations were introduced 
by cutting the unique intrinsic restriction sites AclI and NotI followed by T4 DNA 
polymerase-mediated fill-in. F3 Larvae carrying the targeting construct together 
with an hsp70-flp and a hsp70-I-Scel transgene were heat shocked (1h at 38 °C on 
day 2 and 3 after egg laying) to generate a linearized targeting construct that aligns 
and recombines with the endogenous wild-type locus. After crossing the F3 flies to 
eye-FLP-expressing flies, the F4 was screened for non-mosaic white’ eyed flies 
carrying an insertion of a processed targeting construct. After processing with FLP 
and I-Scel, only one FRT site remained next to the white* marker gene in these 
flies. Flies still carrying the original donor insertion lost the white’ marker gene, 
since it was still flanked by two ERT sites. Female F4‘w* wen® flies, carrying two 
wgn copies on the X chromosome were crossed with males of the genotype ‘v/Y ; 
+/+ ;[v+, 70I-Crel] 1A/TM6’. The F5 offspring were heat shocked (1h at 38 °C 
on day 2 after egg laying). F5 offspring males were selected against TM6 (‘w+/ 
—7AIY ; +/+ ; [v+, 70I-Crel] 1A/+’) and typically displayed white'’— mosaics. 
When they were individually crossed to the X-chromosomal balancer stock ‘bin/ 
#852’, more than 90% of the males were sterile. The F6 flies that hatched were 
either white’ or white’. The white flies were tested for the presence of 
the mutations. The stocks ‘y w (v); P[ry+, 70FLP]4 P[v+, 70I-Scel]2B Sco/S2 
Cyo’ and ‘v; +/+; [v+, 70I-CreI] 1A/ TM6, Ubx, e’ were provided by K. Golic 
and Y. Rong. 
Genome-wide screen. As an unbiased approach to identify molecules that are 
required for neoplastic growth, we used the elavGal80; rn>avl-RNAi/ TM6b, 
tub>Gal80 testerline, which is characterized by neoplastic growth in the discs 
and a delay of about 2 days in pupariation. We crossed this line to the Vienna 
Drosophila RNAi Center (VDRC) collection of phiC31 (KK) RNAi lines (10,100 
lines) and screened for candidates that were able to rescue the delay at pupariation 
as described previously’. One-hundred and twenty-one candidates significantly 
rescued the pupariation delay in rn>avl-RNAi larvae. Of these, only 8 lines res- 
cued the neoplastic disc growth (Extended Data Fig. 1) 
Immunostainings of larval tissues. Tissues dissected from larvae in 1X PBS 
(137 mM NaCl, 2.7mM KCl, 4.3mM Naj,HPO,, 1.47mM KH>POu,, pH 8) at 
the indicated hour AED, were fixed in 4% formaldehyde (Sigma) in PBS for 
20min at room temperature washed in PBS containing 0.1% Triton X-100 
(PBT), blocked for 2h in PBT containing 10% FBS (PBS-TF), and incubated 
overnight with primary antibodies at 4°C. The next day, cells and tissues were 
washed, blocked in PBS-TF, and incubated with secondary antibodies at 1/500 
dilution (Rhodamine Red-X-conjugated donkey anti-rat, Cy3 conjugated donkey 
anti-rabbit, Cy3 conjugated donkey anti-chicken, Cy3 conjugated donkey anti- 
mouse and Cy3 conjugated mouse anti-guinea pig, Cy5 conjugated donkey anti- 
rat, and Cy5 conjugated donkey anti-guinea pig from Jackson ImmunoResearch, 
goat anti-mouse Alexa Fluor 488, donkey anti-rat Alexa Fluor 488 and goat anti- 
rabbit Alexa Fluor 488 from Invitrogen) for 2 h at room temperature. After wash- 
ing, tissues were mounted in Vectashield containing DAPI for staining of DNA 
(Vector Labs). Fluorescence images were acquired using a Leica SP5 DS (X20 and 
x40 objectives) and processed using Adobe Photoshop CS5. 
Lysis and immunoprecipitations. Cells were lysed in 100 ul buffer A (50 mM 
Tris-HCl (pH 8), 150 mM NaCl, 1% NP-50, 1 mM EGTA, 0.5 M NaF, phosphatase 
inhibitor cocktail 2 (Sigma), complete protease inhibitor cocktail (Roche)). 
Immunoprecipitations were performed from about 1 X 10” S2 cells. Cells were 
lysed in 200 ll buffer A, and cell extracts were cleared of membranous material by 
centrifugation at 10,000r.p.m. for 15 min. The cleared extracts were incubated 
with protein G-Sepharose beads (Sigma) for 1h to reduce unspecific binding of 
proteins to the beads in the subsequent purifications. Next, the precleared extracts 
were incubated with 80 ll of protein G-Sepharose beads and 1 ul of the relevant 
antibody for 3 h. Subsequently, beads were washed three times in buffer A, boiled 
in sample and reducing buffers (Invitrogen). 
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Protein preparation. Drosophila Veli full length (Veli’’®°) and PDZ domain 
(Veli®>"!°) were expressed with an amino-terminal Hisg-tag from pET43 
(Novagen) in Escherichia coli strain BL21 Rosetta, and purified by affinity on 
Ni-NTA beads (Qiagen). The Veli L27 domain (Veli! *') was cloned ina modified 
pET43 plasmid as a Hisg-MBP-tagged construct. The fusion protein was 
expressed in bacteria similarly to the other constructs. After a first step of affinity 
purification on amylose beads (NEB), the protein was incubated with TEV prote- 
ase (Invitrogen Life Technologies) to cleave the MBP tag, and loaded on a 
Resource-Q anion-exchange column (GE-Healthcare). The TRAF domain of 
Drosophila Traf2 (Traf2*!°*”*) was expressed similarly, and purified by affinity 
followed by ion-exchange chromatography on a Resource-S column (GE- 
Healthcare). All Grnd constructs were expressed as GST-fusion proteins using a 
modified version of the pGEX-6PI vector (GE-Healthcare), and purified by affinity 
on glutathione sepharose (GSH) beads. The purifications were carried out at 4 °C. 
The PDZ domains of human Par3 (Par3-PDZ2 and Par3-PDZ3 encompassing 
residues 465-548 and 582-685, respectively) and Drosophila Bazooka (Baz-PDZ1 
corresponding to residues 331-419) were expressed as GST-fusion proteins. After 
the first step of affinity purification on GSH-beads, the GST tag was removed from 
the constructs by cleavage with PreScission protease, and the PDZ domains were 
further purified by ion-exchange chromatography. 

In vitro binding assay. To test the interaction properties of intracellular Grnd 
constructs, GST-Grnd fusion proteins (511M) adsorbed on GSH-beads were 
incubated for 40 min at 4°C with 10 uM Veli®, Veli?” or Veli?” in a buffer 
containing 10 mM HEPES pH 7.4, 30 mM NaCl, 5% glycerol, 0.5 mM EDTA and 
1mM dithiothreitol (DTT). After two washing steps, the bound species were 
resolved on SDS-PAGE and visualized by Coomassie staining. To test the specifi- 
city of binding, analogous pull-down assays were performed with the PDZ 
domains of Par3 and Bazooka under the same conditions. To monitor the ability 
of Traf2 to bind Grnd, 2 1M GST-Grnd!>*"! wild type or mutated on P200 and/ 
or E202 were absorbed on GSH beads and incubated with 10 uM Traf2?!°~*”° in a 
buffer consisting of 10mM HEPES pH 7.4, 0.15M NaCl, 5% glycerol, 0.5 mM 
EDTA, 1 mM DTT and 0.05% Tween. After washing, complexes immobilized on 
beads were analysed by SDS-PAGE. 

Subcellular fractioning. Subcellular fractioning of 300 wing imaginal discs dis- 
sected from wild-type animals was performed using the Qproteome Cell 
Compartment Kit (Qiagen) according to the manufacturer’s protocol. 

Western blotting. Proteins were resolved by SDS-PAGE using 4—12% gradient 
gels (NuPAGE Novex gel, Invitrogen) and transferred electrophoretically to poly- 
vinylidene difluoride membranes (Milipore). The membranes were incubated for 
1h in blocking buffer (PBS, 5% milk) and incubated overnight at 4 °C in the same 
buffer containing primary antibodies at 1:1,000 dilutions: rabbit anti- 
Smoothened*, mouse anti-Lamin (Developmental Studies Hybridoma Bank 
(DSHB)), mouse anti-Dlg (4F3) from DSHB, mouse anti-Myc 9E10 (sc-40, 
Santa Cruz Biotechnology), rabbit anti-Myc A-14 (sc-789, Santa Cruz 
Biotechnology), mouse anti-Flag F3165 (Sigma), rat anti-HA 3F10 (Roche)), gui- 
nea pig anti-Grnd at 1:1,000, anti-tubulin (Sigma) at 1:2,000. The anti-Wgn anti- 
body was used as described previously”. Membranes were washed three times in 
PBS-T, blocked for 1h, and probed with secondary antibodies in blocking buffer 
for 1 h at room temperature. After three washes in PBS-T, chemiluminescence was 
observed using the ECL-Plus western blotting detection system (Amersham 
Biosciences). Images were generated using the Fujifilm Multi Gauge software. 
Antibodies. To generate purified antisera specific for the Grnd protein, two pep- 
tides, CAYPHPATANGKDLNA and NAINHPGSDLERAQSQ, corresponding to 
amino acids 140-154 and 161-176, respectively, were used as immunogens in 
guinea pigs (Eurogentec). The following primary antibodies were used for immu- 
noflourescence in this study: mouse anti- Wg (4D4) from DSHB 1/50, mouse anti- 
Dlg (4F3) from DSHB 1/500, rat anti-E-Cad (DCAD2) from DSHB 1/500, mouse 
anti-MMP1 (3A6B4/5H7B11/3B8D12; the three different antibodies were mixed 
in amounts (1:1:1) from DSHB (1:25)), chicken anti-B-galactosidase from 
GeneTek 1/1,000, rabbit anti-cleaved caspase-3 (Asp 175) from Cell Signaling 
Technology 1/500, rabbit anti-aPKC from Santa Cruz Biotechnology 1/1,000, 
guinea pig anti-Grnd 1/100, rabbit anti-Veli 1/1,000 (gift from G. Pietrini), and 
rat anti-Crb 1/1,000 (gift from E. Knust). 

Sequence alignment and homology modelling. Multiple sequence alignment, 
per cent identity matrix, and phylogenic data were generated using Clustal Omega 
(http://www.ebi.ac.uk/Tools/services/web/toolform.ebi?tool=clustalo). Data of 
five human death receptors of the TNFR superfamily with the highest identity 
with Grnd are shown. Slight manual adjustment of the alignments was performed. 
Amino acids are highlighted where Grnd shares identity or similarity with death 
receptor(s) (Fig. 1) or Grnd-like proteins from other insect species (Extended Data 
Fig. 2d), or where Fas shares identity or similarity with Grnd and/or Wgn 
(Extended Data Fig. 2b). Amino acid similarity is presented based on scores in 


Gonnet PAM 250 matrix (>0.5, highly similar; =0.5, weakly similar). Amino acid 
positions are indicated in parentheses or at the end of the shown sequences. The 
phylogram was drawn using interactive Tree of Life’ (http://itol.embl.de/). 
Number on each branch indicates branch length (Extended Data Fig. 2a). 
UniProt protein identifiers are: Fas (P25445), DR4 (000220), DR5 (014763), 
DcRI1 (014798), DcR2 (Q9UBN6), and Wengen (Q9VWS4). Information of pro- 
tein domains in TNFR superfamily members used to construct protein domain 
diagram (Extended Data Fig. 2d) was obtained from UniProt (http://www.uni- 
prot.org/). 

Genotypes. Fig. la: w_; +; rn-Gal4/ puc-lacZ. Fig. 1b: w_ ; elav-Gal80/+; rn-Gal4, 
UAS avl RNAi GD/ puc-lacZ. Fig. 1c: w’ ; elav-Gal80/ UAS grnd RNAi KK; rn- 
Gal4, UAS avl RNAi GD/puc-lacZ. Fig. 1d: w' ; elav-Gal80/ UAS tak1 RNAi KK; 
rn-Gal4, UAS avl RNAi GD/puc-lacZ. Fig. li: w ; nub-Gal4/+; UAS grnd / 
puc-lacZ. Fig. 1j: w-; nub-Gal4/+; UAS grnd-intra / puc-lacZ. 

Fig. 2a: w_; GMR-Gal4/+; +. Fig. 2b: w ; GMR-Gal4, UAS egr/+; MKRS/+. 
Fig. 2c: wen*°; GMR-Gal4, UAS egr/+; MKRS/+. Fig. 2d: w_ ; GMR-Gal4, UAS egr/ 
UAS grnd RNAi KK; MKRS/+. Fig. 2e: w"; GMR-Gal4, UAS egr/ +; MKRS/ UAS 
grnd-extra. Fig. 2f: yw, hs-FLP; Act>CD2>Gal4, UAS GFP/+; UAS egr/+. Fig. 2g: 
yw, hs-FLP; Act>CD2>Gal4, UAS GFP/ UAS tak1 RNAi KK; UAS egr/+. Fig. 2h-i: 
yw, hs-FLP; Act>CD2>Gal4, UAS GEP/ UAS grnd RNAi KK; UAS egr/+. 

Fig. 3a: yw, hsFLP; Act>CD2>Gal4, UAS GFP/+; +. Fig. 3b: yw, hs-FLP; 
Act>CD2>Gal4, UAS GEFP/ UAS rh3 RNAi KK; +. Fig. 3c: yw, hs-FLP; 
Act>CD2>Gal4, UAS GFP/ UAS grnd RNAi KK; +. Fig. 3d: yw, hs-FLP; 
Act>CD2>Gal4, UAS GFP/+; UAS scrib RNAi GD/+. Fig. 3e: yw, hs-FLP; 
Act>CD2>Gal4, UAS GFP/ UAS rh3 RNAi KK; UAS scrib RNAi GD/+. 
Fig. 3f: yw, hs-FLP; Act>CD2>Gal4, UAS GFP/ UAS grnd RNAi KK; UAS scrib 
RNAi GD/+. Fig. 3g: yw, ey-FLP; Act>y'>Gal4, UAS GFP; UAS ras‘'?/+; 
FRT82B, scrib’| FRT82B, tubGal80. Fig. 3h: yw, ey-FLP; Act>y'>Gal4, UAS 
GFP; UAS ras‘'”, UAS grnd RNAi KK /+; FRT82B, scrib'/ FRT82B, tubGal80. 
Fig. 3i: yw, ey-FLP; Act> y*>Gal4, UAS GFP; egr' | UAS ras‘", egr’; FRT82B, 
tubGal80/ FRT82B, scrib’. 

Fig. 4a, b: w_. Fig. 4c: w_ ; UAS crb-intra/+; rn-Gal4 /+. Fig. 4d: w" ; UAS crb- 
intra/ UAS puc; rn-Gal4/+. Fig. 4e: w_ ; UAS crb-intra/ UAS grnd RNAi KK; rn- 
Gal4 /+. Fig. 4f w”; grnd"°/ ern"; rm-Gal4 / UAS crb-intra. Fig. 4g: w’; 
UAS crb-intra/ UAS veli RNAi ; rn-Gal4/ UAS veli RNAi. Fig. 4h: w- ; UAS crb- 
intra/ nub-Gal4; veli®/ veli®. Fig. 4i: w_ ; +; UAS crbAFBM-intra/rn-Gal4. Fig. 4j: 
w_; UAS grnd RNAi KK/+ ; UAS crbAFBM-intra/ rn-Gal4. Fig. 4k: w” ; en-Gal4, 
UAS GFP/+; UAS aPKCAN/ puc-lacZ. Fig. 41: w_; UAS grnd RNAi KK/ en-Gal4, 
UAS GEP; UAS aPKCAN/ puc-lacZ. 

Extended Data Fig. 1c: w ; elav-Gal80/+; rn-Gal4, UAS avl RNAi GD /+. 
Extended Data Fig. 1d: w ; elav-Gal80/ UAS grnd RNAi GD; rn-Gal4, UAS avl 
RNAi GD /+. Extended Data Fig. le: w ; elav-Gal80/ Rh3 RNAi KK; rn-Gal4, 
UAS avl RNAi GD /+. Extended Data Fig. 1f, g: w ; ptc-Gal4, UAS-GFP/ UAS 
grnd RNAi KK; +. 

Extended Data Fig. 4a: w ; +; rn-Gal4 /+. Extended Data Fig. 4b: w; +; rn- 
Gal4 / UAS grnd. Extended Data Fig. 4c, d: w ; +; rn-Gal4 / UAS grnd-intra. 
Extended Data Fig. 4e: yw’ ; sal-Gal4 /+; UAS grnd-intra /+. Extended Data Fig. 
4fiyw , hep”; sal-Gal4 /+; UAS grnd-intra /+. 

Extended Data Fig. 5a: w ; GMR-Gal4, UAS egr/+; MKRS/+. Extended Data 
Fig. 5b: w ; GMR-Gal4, UAS egr/ UAS-traf2 RNAi GD; MKRS/+. Extended 
Data Fig. 5c: w ; GMR-Gal4, UAS egr/ UAS puc? A, MKRS/+. Extended Data 
Fig. 5d: w ; GMR-Gal4, UAS egr/ UAS takl1 RNAi KK; MKRS/+. Extended 
Data Fig. 5e: w ; GMR-Gal4, UAS egr/ UAS-wengen RNAi IR; MKRS/+. 
Extended Data Fig. 5f: w ; GMR-Gal4, UAS egr/ UAS-grnd RNAi GD; 
MKRS/+. Extended Data Fig. 5g: wen*°; GMR-Gal4, UAS egr/ UAS-grnd RNAi 
GD; MKRS/+. Extended Data Fig. 5h: wen? ; +; +. Extended Data Fig. 5i: w ; 
GMR-Gal4/ UAS-grnd RNAi KK; +. Extended Data Fig. 5j: w-; GMR-Gal4/+; 
UAS-grnd-extra/+. Extended Data Fig. 5k: w ; GMR-Gal4, UAS-ecto-egr60/+; 
MKRS/+. Extended Data Fig. 51: w ; GMR-Gal4, UAS-ecto-egr60/ UAS puc, 
MKRS/+. 

Extended Data Fig. 6c: w”. Extended Data Fig. 6d: w; grnd"°5/ grnd Mire’; +. 
Extended Data Fig. 6e: w ; +; rn-Gal4, UAS egr, tubGal80"* /+. Extended Data 
Fig. 6f: w ; UAS-grnd RNAi KK/+; rn-Gal4, UAS egr, tubGal80'/+. Extended 
Data Fig. 6g: w ; UAS-traf2 RNAi GD /+; rn-Gal4, UAS egr, tubGal80"/+. 
Extended Data Fig. 6h: w ; UAS-takl RNAi KK/+; rn-Gal4, UAS egr, 
tubGal80"/+. Extended Data Fig. 6i: w; grnd™""°/ grnd"°*; rn-Gal4, UAS 
egr, tubGal80'/+. Extended Data Fig. 6j: w ; grnd™'"°/ grnd”; rn-Gal4, UAS 
egr, tubGal80'/+. 

Extended Data Fig. 7a: yw, ey-FLP; Act>y* >Gal4, UAS GFP; +; FRT82B/ 
FRT82B, tubGal80. Extended Data Fig. 7b: yw, ey-FLP; Act>y* >Gal4, UAS 
GFP; +; FRT82B, scrib'/ FRT82B, tubGal80. Extended Data Fig. 7c: yw, ey-FLP; 
Act>y* >Gal4, UAS GEP; UAS grnd RNAi KK /+; FRT82B, scrib'/ FRT82B, 
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tubGal80. Extended Data Fig. 7d: ey-FLP; Act>CD2>Gal4, UAS GFP/ UAS grnd 
RNAi KK ; +. Extended Data Fig. 7e-g: yw, ey-FLP; Act>y* >Gal4, UAS GFP; 
UAS ras’'? /+; FRT82B, scrib'/ FRT82B, tubGal80. 

Extended Data Fig. 8a: yw, ey-FLP; Act>y* >Gal4, UAS GFP; UAS ras‘7/+; 
FRT82B, scrib'/ ERT82B, tubGal80. Extended Data Fig. 8b: yw, ey-FLP; 
Act>y* >Gal4, UAS GFP; UAS ras‘, UAS grnd RNAi KK /+; FRT82B, scrib'/ 
FRT82B, tubGal80. Extended Data Fig. 8c: yw, ey-FLP; Act>y* >Gal4, UAS GFP; 
UAS ras’'?, UAS dilp8 RNAi KK /+; FRT82B, scrib'/ FRT82B, tubGal80. 
Extended Data Fig. 8d: w; UAS ras’'?, UAS-dlg RNAi GD /+; ey-FLP; 
Act>>Gal4, UAS GFP /+. Extended Data Fig. 8e: w; UAS ras’’”, UAS-dlg RNAi 
GD / UAS-grnd RNAi GD ; ey-FLP; Act>Gal4, UAS GFP /+. Extended Data 
Fig. 8f w; UAS ras‘'”, UAS-dlg RNAi GD / UAS-grnd RNAi KK ; ey-FLP; 
Act>Gal4, UAS GFP /+. Extended Data Fig. 8g: wen®°; UAS ras‘'?, UAS-dlg 
RNAi GD /+; ey-FLP; Act>>Gal4, UAS GFP /+. 

Extended Data Fig. 9a: w_; +; rn-Gal4, UAS av! RNAi GD/+. Extended Data 
Fig. 9b: w ; egr’ / egr ; rn-Gal4, UAS avl RNAi GD/+. Extended Data Fig. 9c, d: 
w ; +; +. Extended Data Fig. 9e: w_; elav-Gal80/+; rn-Gal4, UAS avl RNAi GD/ 
+. Extended Data Fig. 9f: w_; UAS crb-intra/+; rn-Gal4 /+. Extended Data Fig. 
9g: w_ ; elav-Gal80/+; rn-Gal4, UAS avl RNAi GD/+. Extended Data Fig. 9h: w’ ; 
UAS crb-intra/+; rn-Gal4 /+. Extended Data Fig. 9i: wgn*°; UAS crb-intra/+; rn- 
Gal4/+. Extended Data Fig. 9j: w_ ; +; rn-Gal4/ diap-GFP. Extended Data Fig. 9k: 
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w_; UAS crb-intra/+; rn-Gal4 / diap-GFP. Extended Data Fig. 91: w_; UAS crb- 
intra/ UAS-grnd RNAi KK ; rn-Gal4 / diap-GFP. 

Extended Data Fig. 10f, g: yw, hsFLP; +; FRT82B, Ubi-GFP/ FRT82B, crb'!47?, 
Extended Data Fig. 10h: yw, hs-FLP; Act>CD2>Gal4, UAS GFP/ UAS veli RNAi; 
UAS veli RNAi/+. 
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Extended Data Figure 1 | Identification of Grnd in a genetic screen for 
molecules implicated in neoplastic growth. a, Schematic of the genome-wide 
screen. Ten-thousand one-hundred RNAi lines were screened for their abilities 
to rescue the delay of the rn>avl-RNAi condition. Of the 121 candidates able to 
rescue the delay of the rn>avl condition, only 8 also rescued the neoplastic 
growth. b, Schematic of the five components of the JNK pathway shown to be 
required for the neoplastic overgrowth in the av! loss of function condition. 
Although no RNAi line targeting Traf2 was included in the original screen, 
Traf2 was subsequently shown to be required for neoplastic overgrowth in the 
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avl-RNAi condition using an alternative RNAi line. c-e, Wing discs of the 
indicated genotypes dissected 5 days (left) or 6 days (right) AED and stained for 
Wg (green) and Caspase3 (red). f, g, Transverse (f) or xy (g) sections of 
dissected wing discs expressing grnd RNAi in the patch domain (GFP, green) 
stained for Grnd (red) and DAPI (g, right, blue) to visualize the nuclei. 

h, Subcellular fractionation of cleared extract from dissected wing discs shows 
that Grnd, like the transmembrane receptor Smoothened, localizes to the 
membrane fraction (PM), whereas Disc-large remains in the cytoplasmic 
fraction (Cyto). Laminin serves as a control for nuclear fractions. 
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= Percentage identity matrix 
Wengen Grnd Fas DcRi DcR2 DR4 DRS 
Wengen(89-145) 100.00 14.00 21.82 1404 14.29 17.54 14.04 
Grnd (29-81) 14.00 100.00 2600 2200 20.00 17.65 19.61 
Fag (i817) 21.82 26.00 100.00 33.93 36.36 39.29 32.14 
DcR1 (100-1 L 14.04 22.00 33.93 100.00 61.40 60.00 60.34 
DcR2 (130-18 14.29 20.00 36.36 61.40 100.00 58.62 65.52 
DR4 (179- 2-0 17.54 17.65 39.29 60.00 5862 100.00 79.66 
DR5 (128-186 14.04 19.61 32.14 60.34 65.52 79.66 100.00 
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Extended Data Figure 2 | Sequence homology of the extracellular domain of 
Grnd to CRDs of human TNFRs. a, Per cent identity matrix and phylogenic 
data from multiple local sequence alignment of the entire extracellular domains 
of Grnd and Wgn with extracellular domains (corresponding to a part of CRD2 
and all of CRD3) of various human TNER superfamily members. The 
extracellular domain of Grnd shares the highest identity with the GBM- 
harbouring CRD of Fas (26%). A similar result is observed for Wgn, although it 
exhibits lower identity (22%) and a higher degree of divergence to Fas. 

b, Sequence alignment of Fas, Grnd and Wgn demonstrates that Grnd shares a 
higher degree of similarity with the GBM-harbouring CRD of Fas than Wgn. 
This is particularly apparent in the immediate region encompassing the 
functionally critical F134 (marked with an asterisk) of the GBM of Fas. Blue 
indicates residue identity, yellow strong similarity, and grey weak similarity of 
Fas with Grnd and/or Wgn. c, Schematic representation of Grnd, Wgn, and 


various members of the human TNER superfamily showing a high level of 
diversity in the number and arrangement of CRDs. The colour codes used are: 
red for signal peptide, green for extracellular domains, orange for 
transmembrane domains, and blue for intracellular domain. Numbers in each 
box indicate corresponding amino acid positions. Pro, prodomain; T1-5, 
TAPE1-5 (threonine-, alanine-, proline- and glutamine-rich repeats). 

d, Multiple sequence alignment showing the conservation of the extracellular 
domain of Grnd in other insect species. Whereas global similarity of the 
extracellular domain of Grnd degenerates outside the melanogaster group, the 
cysteine residues of the CRD (highlighted in red) and the aromatic residue (F/Y, 
marked with an asterisk) corresponding to F134 of Fas CRD3 remain highly 
conserved throughout the Hymenoptera order. Blue indicates residue identity; 
yellow, strong similarity; and grey, weak similarity to Grnd. 
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Extended Data Figure 3 | Identification of a conserved TRAF6-like binding 
motif in the intracellular domain of Grnd proteins. Multiple sequence 
alignment of the intracellular domain of Grnd and Grnd-like proteins in other 
species. A binding motif for TRAF6 (PxExx(aromatic/acidic residue)), the 
closest homologue of Drosophila Traf2, was previously identified in human 
TNERs*. The position of a related putative Traf2-binding motif in the 


TRAF6-like binding motif 


intracellular domain of Grnd is indicated. Note that, although sequence 
similarity of the intracellular domain of Grnd degenerates outside the family 
Drosophilidae, the core residues of this motif (highlighted in red) remain highly 
conserved throughout the order Hymenoptera. Blue indicates residue identity; 
yellow, strong similarity to Grnd. 
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Extended Data Figure 4 | Grnd acts upstream of JNK signalling. a-d, Wild- _ the sal domain (marked by a truncated line) in a wild-type (d) or hep” mutant 
type wing discs (a) or discs expressing grnd (b) or grnd-intra (c, d) in the rn (e) background were dissected 5 days AED and stained for W¢ (left; green), 
domain were dissected 5 days AED and stained for Wg (left; green), Cas3 Cas3 (middle; red), and Grnd (right; grey). 

(middle; red), and Grnd (right; grey). d, e, Wing discs expressing grnd-intra in 
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Extended Data Figure 5 | Characterization of the wgn*° mutant allele. 
a-e, Light micrographs of Drosophila adult eyes are shown. Original 
magnification, 30. The small-eye phenotype caused by GMR-driven 
expression of egr (a) is rescued by co-expressing puc (c) or reducing the activity 
of Traf2 (b), but not by reducing Wgn activity (d; also serves as a control to 
exclude a Gal4 titration effect). f, g, Reducing Grnd activity partially rescues the 
small-eye phenotype, giving rise to the hanging-eye phenotype (f) and is not 
further rescued in a wgn*° mutant background (g). h, wgn*° mutant flies do 
not display any eye phenotype. i, j, Reducing Grnd levels or expressing Grnd- 
extra in the GMR domain does not affect eye morphology. k, GMR-Gal4- 


induced expression of a soluble form of Egr lacking the transmembrane and 
cytoplasmic domain (Ecto-Egr-60) gives rise to a small-eye phenotype"®. 1, Co- 
expression of puc*“ in a GMR>ecto-egr-60 background partially rescues the 
small-eye phenotype, giving rise to a hanging-eye phenotype’®. m, Sequence 
analysis of DNA extracted from flies homozygous for the wgn*° mutation 
showing the two frameshift mutations introduced by cutting the unique 
intrinsic restriction sites AcII (exon 1) and Not! (exon 2) followed by T4 DNA 
polymerase-mediated fill-in. n, Western blot of fly extracts from the indicated 
genotypes probed with an anti-Wgn antibody”’. KO, knockout; OE, 
overexpressing; WT, wild type. 
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Extended Data Figure 6 | Characterization of the grnd“”"”* mutant. 

a, Schematic representation of the Minos transposon insertion in the grnd gene 
region corresponding to the 5’ untranslated region of the transcript. b, Western 
blot of wing disc extracts from the indicated genotypes probed with anti-Grnd 
(top) and anti-B-tubulin (bottom) antibodies. Grnd protein migrates at the 
predicted size of 27 kDa (lane 1, control). Grnd protein levels are decreased 
below detection level in both grndi"°™""*s and grnd“"°*F mutant flies, 
indicating that grndMi"°/Min°s is indistinguishable from a null mutant. 


c, d, Polarity is not affected in grnd”°"™""* and grndM? wing discs. 
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Transverse sections of dissected control (c) or grnd Mines’ Minos (q) wing discs 
stained for Grnd (top; red), Crb (middle top; blue) and Dlg (middle bottom; 
green). e-j, Wing discs dissected from third instar larvae after 40h of Egr 
expression in the rn domain. Pouch ablation, ubiquitous Wg expression and 
apoptosis observed in rn>egr, Tub-Gal80"* discs (e) are suppressed to various 
extents upon co-expression of grnd-RNAi (f) or reduction of JNK activity 

(g, h) and fully suppressed in grnd™7°”M°> (i) and grndM"°"F (j) mutant 
backgrounds. 
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Extended Data Figure 7 | Grnd is highly expressed in Ras”'?/scrib~/— 
micro-metastases. a—c, Reducing grnd levels in scrib~’~ mutant clones 
prevent their elimination. Eye discs dissected 5 days AED carrying GFP- 
labelled MARCM clones of the indicated genotypes stained for Grnd (middle; 
red) and DAPI (right; grey). d, Reducing Grnd levels does not affect cell 
viability. Eye discs dissected 5 days AED carrying GFP-labelled grnd RNAi Flip- 


LETTER 


out clones stained for Grnd (middle; red) and DAPI (right; grey). e-g, Invasive 
brain tumours and micro-metastases display high levels of Grnd protein. 
Dissected eye-brain complex (e), gut (f) and fat body (g) displaying GFP- 
labelled invasive Ras’!7/scrib-/— clones (e; green) or micro-metastases 

(f, g; green) stained for Grnd (red) and DAPI (blue). 
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Extended Data Figure 8 | Grnd, but not Wgn, is required for Mmp1 
expression both in Ras”’?/scrib' and Ras’'*/Dlg RNAi tumours. a-c, Eye- 
brain complexes dissected 7 days AED carrying Ras‘7/scrib ’~ (a), Ras’!?/ 
scrib/~ + grnd-RNAi (b), or Ras’ !?/scrib/~ + dilp8-RNAi (c; control RNAi 
to exclude a Gal4 titration effect) clones labelled by GFP and stained for Mmp1 
(middle; red). Ras*!?/scrib-/~ and Ras‘!?/scrib-’~ + dilp8-RNAi clones 


invade the ventral nerve cord (a, ¢; see white arrow). d-g, Eye-brain complexes 
dissected 7 days AED carrying Ras’ '?/Dig-RNAi (d) or Ras’’7/Dig-RNAi + 
grnd-RNAi (e, f) clones (labelled by GFP), or Ras’!”/Dig-RNAi GFP-labelled 
clones in a wgn-null mutant background (g) stained for Mmp1 (middle; red) 
and DAPI (right; blue). Ras”’?/Dig RNAi clones invade the ventral nerve cord 
in wgn-null mutant animals (g; see white arrow). 
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Extended Data Figure 9 | Egr is not required for the rn>av! RNAi-induced 
neoplastic growth. a, b, Wing discs of the indicated genotypes dissected 6 days 
AED and stained for Wg (green). c, d, Transverse sections of dissected wild- 
type discs stained for Grnd (red) and either E-cad (c; in green), or aPKC (d; in 
green). e, g, Transverse sections of rn>avl RNAi wing discs dissected 5 days 
AED and stained for Grnd (top, bottom; red) and either E-cad (e, middle, 
bottom; green) or Crb (g, middle, bottom; green). The rm expression domain is 
indicated (e, top). f, Wing discs expressing crb-intra in the rn domain (rn>crb- 
intra) dissected 6 days AED and stained for Disc-large (Dlg, green) to visualize 


neoplastic structures and Grnd (red). h, i, Wing discs expressing crb-intra in the 
rn domain of wild-type (h) or wgn-knockout mutant (i) animals dissected 6 
days AED and stained for Wg (green). j-I, Reducing grnd levels does not affect 
crb-intra-mediated inhibition of Hippo signalling, which causes an 
upregulation of the Yki transcriptional target Diap1. Control discs (j) or wing 
discs expressing Crb-intra alone (k) or together with grnd RNAi (I) in the rn 
domain were dissected from animals carrying a reporter gene for Hippo 
pathway activity (Diap-GFP, green) and stained for Wg (red). 
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Extended Data Figure 10 | Grnd directly interacts with the PDZ domain of 
Veli and neither depends on Veli nor Crb for its proper localization. a, b, 
Schematic representation of Grnd and Veli (also known as Lin-7) domains and 
the various truncations used to map the interface of the Grnd-Veli interaction. 
127, Lin-2/Lin-7 domain; PDZ, PSD-95/Dlg/ZO-1 domain; 

TM, transmembrane domain. c, GST-Grnd!*>*“! immobilized on GSH beads 
was incubated with full-length Veli (Veli’") or the complementary Veli 
fragments L27 (Veli!”’) and PDZ domain (Veli??”). The PDZ1 domain of the 
polarity protein Bazooka was included in the pull-down as a specificity control 
(Baz’?”"). Coomassie-blue stained SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE) was used to visualize species retained on beads. Veli’ and 
Veli?” bind to Grnd (lanes 10 and 12), but not Veli!”” nor Baz’P“! (lanes 11 
and 9), indicating that Grnd specifically recognizes the PDZ domain of Veli. 


d, To map the interface between Grnd and Veli? >~ the same binding assay was 


performed using a battery of Grnd deletion mutants adsorbed on GSH beads. 
Veli?” forms a complex with Grnd’?>*4! and Grnd'?°"1® (lanes 1 and 3), but 
not with the carboxy-terminal region of Grnd (lanes 2 and 4) or GST alone 
(lane 5, control). e, To address further the specificity of the binding of Grnd!??>- 
41 to the PDZ domain of Veli, we repeated the pull-down assay with the PDZ 
domains of Baz and its human orthologue PAR3. Veli’ ”” is retained on GST- 
Grnd!*>-**! beads (lane 9), while PAR3PD22, PAR3?P23 and Baz’ >”! are not 
(lanes 10, 11 and 12). f-g, Reducing Crb levels affects the localization of Veli, 
but not Grnd. f-g, xy (f) or transverse (g) sections of dissected wing discs 
bearing crb ‘~ mutant clones (labelled by absence of GFP) stained for Grnd 
(f, bottom left, g, bottom; white) or Veli (f, bottom right; red). h, Reducing Veli 
levels does not affect Crb” or Grnd localization. Transverse sections of 
dissected wing discs bearing clones expressing veli RNAi (h, bottom; labelled by 
the presence of GFP) stained for Grnd (top; red) and Crb (middle; blue). 
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Viraemia suppressed in HIV-1-infected humans 
by broadly neutralizing antibody 3BNC117 


Marina Caskey'*, Florian Klein', Julio C. C. Lorenzi', Michael S. Seaman?, Anthony P. West Jr*, Noreen Buckley’, Gisela Kremer*”, 
Lilian Nogueira’, Malte Braunschweig"®, Johannes F. Scheid’, Joshua A. Horwitz’, Irina Shimeliovich!, Sivan Ben-Avraham!, 
Maggi Witmer-Pack!, Martin Platten*’, Clara Lehmann*’, Leah A. Burke!®, Thomas Hawthorne’, Robert J. Gorelick!®, 

Bruce D. Walker", Tibor Keler’, Roy M. Gulick®, Gerd Fitkenheuer*’, Sarah J. Schlesinger’ & Michel C. Nussenzweig"”” 


HIV-1 immunotherapy with a combination of first generation 
monoclonal antibodies was largely ineffective in pre-clinical and 
clinical settings and was therefore abandoned’ *. However, recently 
developed single-cell-based antibody cloning methods have uncov- 
ered a new generation of far more potent broadly neutralizing 
antibodies to HIV-1 (refs 4, 5). These antibodies can prevent infec- 
tion and suppress viraemia in humanized mice and nonhuman 
primates, but their potential for human HIV-1 immunotherapy 
has not been evaluated*’°. Here we report the results of a first- 
in-man dose escalation phase 1 clinical trial of 3BNC117, a potent 
human CD4 binding site antibody", in uninfected and HIV-1- 
infected individuals. 3BNC117 infusion was well tolerated and 
demonstrated favourable pharmacokinetics. A single 30 mg kg * 
infusion of 3BNC117 reduced the viral load in HIV-1-infected 
individuals by 0.8-2.5 logio and viraemia remained significantly 
reduced for 28 days. Emergence of resistant viral strains was vari- 
able, with some individuals remaining sensitive to 3BNC117 for a 
period of 28 days. We conclude that, as a single agent, 3BNC117 is 
safe and effective in reducing HIV-1 viraemia, and that immu- 
notherapy should be explored as a new modality for HIV-1 pre- 
vention, therapy and cure. 

A fraction of HIV-1-infected individuals develop potent neutral- 
izing serologic activity against diverse viral isolates*”. Single-cell clon- 
ing methods to isolate antibodies from these individuals’” revealed that 
broad and potent neutralization can be achieved by antibodies target- 
ing many sites on the viral envelope*'*"*. Many of these antibodies can 
prevent infection, and some can suppress active infection in huma- 
nized mice (hu-mice) or macaques*"*. Therefore, it is generally 
accepted that a vaccine eliciting such antibodies is likely to be protect- 
ive against HIV-1. However, potent anti-HIV-1 broadly neutralizing 
antibodies (bNAbs) are highly somatically mutated and many carry 
other uncommon features such as insertions, deletions, or long com- 
plementary determining regions**''!*’°, which may account for the 
difficulty in eliciting such antibodies by immunization. In view of the 
efficacy of passive bNAb administration in hu-mice and maca- 
ques*°®, it has been suggested that bNAbs should be administered 
passively, or by viral vectors for prevention and immunotherapy*”"®. 
However, their safety and efficacy has not been tested in humans. 

To determine whether the new generation of more potent bNAbs are 
safe and active against HIV-1 in humans, we initiated an open label 
phase 1 study (Fig. 1a) with 3BNC117, an anti-CD4 binding site anti- 
body cloned from a viraemic controller’. 3BNC117 neutralizes 195 
out of 237 HIV-1 strains comprising 6 different clades with an average 


half-maximal inhibitory concentration (ICs) of 0.08 jig ml~ ' (Extended 
Data Fig. 1)’’. 12 uninfected and 17 HIV-1-infected individuals (Table 1) 
were administered a single intravenous dose of 1, 3, 10 or 30 mgkg ' of 
3BNC117 (Extended Data Table 1a). 3BNC117 serum concentrations, 
plasma HIV-1 viral loads (VL), CD4* and CD8* T-cell counts, and 
safety were monitored closely (Fig. la, Extended Data Figs 2, 3, and 
Extended Data Tables 1b, 2). The two groups were comparable for 
gender, race and age (Table 1). 

3BNC117 was generally safe and well tolerated at all doses tested in 
both uninfected and HIV-1-infected individuals. No grade 3, 4 or 
serious adverse events and no treatment-related laboratory changes 
were observed during 56 days of follow up (Extended Data Table 1b). 
CD4* or CD8* T-cell counts did not change after 3BNC117 infusion 
in the HIV-1-infected group, possibly because initial CD4* T-cell 
counts were near normal in most participants (mean absolute CD4* 
T-cell count was 655 cells per pl, Extended Data Fig. 2). 

Two different assays were used to measure 3BNC117 levels in 
serum: TZM.bl neutralization assay to measure activity, and anti-idio- 
type-specific ELISA to measure antibody protein levels (Fig. 1b, 
Extended Data Fig. 3 and Extended Data Tables 4, 5). With few excep- 
tions the two assays were generally in agreement in both groups 
(Fig. 1b and Extended Data Fig. 3). However, elimination of 
3BNC117 activity was more rapid in the HIV-1-infected group, result- 
ing in an estimated average t)/2 of around 9 days as opposed to around 
17days in uninfected individuals (Fig. 1b and Extended Data 
Tables 4, 5). We conclude that 3BNC117 has pharmacokinetic prop- 
erties consistent with a typical human IgG1 in uninfected individuals 
and a somewhat faster decay rate in HIV-1-viraemic individuals. 
Similar antigen-dependent enhanced clearance has been reported with 
anti-cancer antibodies’’”. Although there may be other explanations, we 
speculate that the increased rate of antibody elimination in the presence 
of HIV-1 is due to accelerated clearance of antigen-antibody complexes. 

Viral loads were measured by standard assays or by single-copy 
assays. Baseline VLs in HIV-1-infected individuals not on anti-retro- 
viral therapy (ART) varied from 640 to 53,470 copies ml! (mean 9,420 
copies ml"). Two participants were on ART at the time of antibody 
infusion but had detectable baseline VLs (30 and 100 copies ml”). 
(Table 1, Extended Data Table 2a). 

Virologic responses correlated with antibody dose. Infected indivi- 
duals receiving 1 or 3mgkg * 3BNC117 doses showed only small and 
transient changes in viraemia consisting of increases of up to threefold 
1 day after infusion, followed by a short temporary decrease, and rapid 
return to baseline (Fig. 2, Extended Data Fig. 4, and Extended Data 
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Figure 1 | Pharmacokinetics of 3BNC117 in uninfected and HIV-1-infected 
individuals. a, Diagrammatic representation of the study. Time of 3BNC117 
infusion indicated by the red arrow, and sampling for 3BNC117 serum 
levels, HIV-1 viral load, CD4*/CD8* T cell counts and env sequencing as 
indicated below. b, Antibody decay measured in TZM.bl assays (solid lines) and 
ELISA (dotted lines). Mean values and s.e.m. for uninfected individuals (3 per 
group) are shown in blue and for HIV-1-infected individuals (2-5 per group) 
in red. Light grey indicates lower level of accuracy by the ELISA assay and 
dark grey by the TZM.bl assay. Open circles indicate levels lower than the 
accuracy threshold. 


Table 2a). The magnitude and kinetics of the initial increase in virae- 
mia were consistent with those seen with viral entry inhibitors’®. 

In contrast, 10 out of 11 individuals receiving 10 or 30 mg kg’ ' infu- 
sions responded by dropping their VLs by up to 2.5 logio (Fig. 2, 
Extended Data Fig. 4 and Extended Data Table 2a). Two individuals 
off ART received the 10 mgkg ' dose, of whom 1 responded with 1.36 


Table 1 | Study participants demographics 


Uninfected (n = 12) HIV-1-infected (n = 17) 


Gender (% male) 83% 76% 
Mean age (range) 43 (22-58) 37 (20-54) 
Race/ethnicity 

White 42% 29% 
Black or African American 50% 53% 
Hispanic 8% 18% 
ART status 

On ART n (%) - 2 (12%) 
Off ART n (%) - 15 (88%) 


Mean absolute CD4* count - 
(cells pl~+; day 0) 

Mean % CD4* count (day 0) - 
Mean HIV-1 RNA level = 
(copies ml” +; day 0)* 


655 (245-1,129) 


29% (20-42%) 
9,420 (640-53,470) 


* Mean HIV-1 RNA levels in HIV-1-infected participants off ART. 
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logo decline in viraemia and the other did not (Fig. 2 and Extended Data 
Table 2a). The individual that did not respond was infected with a virus 
that was completely resistant to 3BNC117 (2C4; ICs) > 20 Wg ml ~ 1, 
Fig. 3 and Extended Data Table 3). All 8 individuals that received the 
30mgkg ' dose of 3BNC117 showed highly significant and rapid 
decreases in their viral loads that varied between —0.8 and —2.5 logio 
(Figs 2 and 3 and Extended Data Table 2a, b). The magnitude of the 
decline was related to the starting VL and the sensitivity of the subjects’ 
autologous virus to 3BNC117 (Figs 2 and 3, Extended Data Fig. 5). The 
median time to reach the lowest level in viraemia was 7 days, and the 
mean drop in VL was 1.48 logo at lowest level. When compared to all 
available pre-treatment measurements, the drop in viraemia was highly 
significant from days 4 through 28 (Fig. 2 and Extended Data Table 2b). 
Although the limited data set does not allow us to determine viral set 
point, 4 of the 8 individuals receiving a single 30 mgkg ' infusion did 
not entirely return to day 0 pre-infusion levels during the observation 
period of 56 days (Figs 2 and 3, Extended Data Table 2a). 

To further examine the virologic effects of 3BNC117 immunotherapy, 
autologous viral isolates were obtained from cultured PBMCs before 
(day 0, day —7) and after (day 28) antibody infusion. Paired samples 
from 12 of the 17 HIV-1-infected individuals were tested for 3BNC117 
sensitivity (Fig. 3 and Extended Data Table 3). Samples obtained from 
individuals infused with 1 mg kg” ' showed 35- and 13.5-fold decreases in 
3BNC117 sensitivity, indicating that the antibody exerts selective pressure 
on HIV-1 even at the lowest dose (2A3, 2A4; Fig. 3). Similar changes in 
sensitivity were seen for some (2B1, 2C5) individuals treated with 3 and 
10mgkg ', but others remained 3BNC117-sensitive throughout (2B3) 
(Fig. 3, Extended Data Table 3). Similarly, at 30 mg kg’, only 2 out of 5 
individuals tested showed greater than fivefold reduction in 3BNC117 
sensitivity on day 28 (Extended Data Table 3). In contrast, 2C1, 2D1, 2D3 
showed only 3.2-, 1.3- and 2.7-fold changes in sensitivity and these indi- 
viduals did not rebound to baseline viraemia levels at day 28 (Figs 2 and 3). 
We conclude that, in some individuals, HIV-1 develops high-level resist- 
ance to 3BNC117 by 28 days after a single dose, while in others it does not. 

To examine the molecular nature of the changes in HIV-1 in res- 
ponse to 3BNC117, we cloned and sequenced HIV-1 envelopes from 
paired plasma samples from 10 individuals before and 28 days after 
infusion. Evidence for antibody-induced selection was seen in some 
but not all samples analysed (Fig. 4, Supplementary Fig. 1). For example, 
2C5, who received a 10 mgkg’ ' infusion, selected for a G459D muta- 
tion in 15 out of 23 env sequences, while the remainder showed a longer 
V5 loop. The G459D mutation alters the CD4 binding site and can 
result in resistance to 3BNC117 (ref. 9). Changes in the V5 loop can 
alter sensitivity to anti-CD4 binding site antibodies by steric clashing 
with the heavy or light chains of 3BNC117-type antibodies. Similarly, 10 
or 30mgkg ' infusions selected single mutations at Q363H (2E1), 
S461D (2E2), and S274Y (2E2) (Fig. 4, Supplementary Fig. 1). These 
changes may alter sensitivity to 3BNC117 by interfering with binding’. 
Selection in these 3 individuals is also indicated by the emergence of a 
distinct group of closely related sequences in phylogenetic trees (Fig. 4, 
Supplementary Fig. 1). Consistent with the molecular analysis, and the 
viral culture data (Fig. 3), pseudoviruses produced from serum of 2C5 
from days 0 and 28 showed high level 3BNC117 resistance, whereas the 
changes in pseudoviruses produced from 2C1, 2D1, 2E1 and 2E2 were 
modest (Fig. 4 and Extended Data Table 3). In contrast, autologous viral 
isolates from individuals who did not become resistant, or had only 
small changes in sensitivity, such as 2B3, showed little ifany evidence of 
selection. We conclude that a single infusion of 3BNC117 leads to 
selection for high-level resistance in some but not all individuals. 

Although immunotherapy was initially used to treat infectious diseases, 
the great majority of therapeutic monoclonal antibodies are currently 
used to treat cancer and autoimmune diseases. This form of therapy 
has been shown to be highly effective, well tolerated, and to function in 
large part by engaging the host immune system through Fc receptors”. 

In contrast, a role for antibodies in controlling HIV-1 infection has 
been difficult to establish. For example, the overall course of infection is 
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Figure 4 | HIV-1 envelope sequence analysis after 3BNC117 infusion. 

a, HIV-1 envelopes were cloned from plasma samples. Logogram showing env 
gp120 regions (amino acid positions; 270-285, 360-371, and 455 to 471-485, 
according to HXBc2 numbering) indicating sequence changes from day 0 (blue 
bar) to day 28 (red bar). The frequency of each amino acid is indicated by its 
height. Red residues represent mutations that were only found after treatment, 
black residues represent amino acids that changed in frequency after treatment, 
empty boxes represent gaps, and + symbols represent 3BNC117 contact sites 
on gp120 (ref. 30). b, Phylogenetic trees show gp120 evolution from day 0 to 28 


not thought to be altered in individuals that develop bNAbs”. Moreover, 
first generation anti-HIV-1 bNAbs with limited breadth and activity 
produced little if any measurable effects in hu-mice or viraemic indivi- 
duals***!. However, antibodies can put strong selective pressure on the 
virus in individuals that develop anti- HIV-1 antibody responses”. In 
addition, recent studies in hu-mice showed that, when administered as 
monotherapy, new generation bNAbs can transiently reduce VLs, and 
in combination they control viraemia for as long as concentrations 
remain in the therapeutic range®’. In contrast, single antibodies led to 
control of viraemia in SHIV-infected macaques for as long as antibody 
levels remained therapeutic, and immune escape was rarely observed’*. 
The surprising difference between hu-mice and macaques might be 
attributed in part to an intact host immune system in the macaque, 
including endogenous antibodies”, or differences between SHIV and 
HIV-1 infection. Our data establish that passive infusion of single 
bNAbs can have profound effects on HIV-1 viraemia in humans. 
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after treatment for 2C1, 2C5, 2D1, 2E1 and 2E2. Blue and red bars represent 
sequences obtained on days 0 and 28, respectively (Supplementary Fig. 1). 
Illustrated values represent the ICs of 3BNC117 in 1g ml * against the cloned 
HIV-1 pseudoviruses from the analysed sequences (Extended Data Table 3). 
The geometric means of the pseudoviruses’ ICs, values on days 0 and 28, 
respectively are for 2C1: 0.06 and 0.14 1g ml’; 2C5: 0.02 and 7.09 pg ml |; 
2D1: 0.15 and 0.52 pg ml '; 2E1: 0.09 and 0.23 pg ml’; 2E2: 0.01 and 

0.03 ug ml 7. 


Combinations of antiretroviral drugs are the standard of care for 
HIV-1 infection because resistance develops to single agents”. 
Similarly, monotherapy with 3BNC117 alone is insufficient to control 
infection, and we expect that antibody-drug or antibody-antibody com- 
binations will be required for complete viraemic control. Although the 
current generation of drugs is less expensive than antibodies, the latter 
have very long half-lives and have the potential to kill infected cells and 
to enhance host immunity by engaging Fc receptors'*””. Moreover, anti- 
HIV-1 antibodies can be made 100-fold more potent by molecular 
engineering”. Finally, the combination of antibodies with agents that 
activate latent viruses can interfere with the HIV-1 reservoir in hu- 
mice” and may be critical to HIV-1 eradication strategies. 

Given the difficulties in developing an HIV-1 vaccine and in erad- 
icating established infection, passive transfer of monoclonal antibodies 
is being considered for HIV-1 prevention, therapy, and cure. Our data 
establish the principle that monoclonal antibodies can be both safe and 
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effective against HIV-1 in humans. Antibody-mediated immunother- 
apy differs from currently available drugs in that it has the potential to 
affect the course of HIV-1 infection by engaging host immunity directly. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Study design. An open-label, dose-escalation phase 1 study was conducted in unin- 
fected (Group 1) and HIV-1-infected subjects (Group 2; http://www.clinicaltrials.gov; 
NCT02018510). Study participants were enrolled sequentially according to eligibility 
criteria. A standard “3+ 3” phase I trial design was used in the dose-escalation phase 
of the study. 3BNC117 was administered as a single intravenous infusion at four dose 
levels: Img kg! (subjects 1A1, 1A2, 1A3, 2A1, 2A3, 2A4), 3 mg kg! (subjects 1B1, 
1B2, 1B3, 2B1, 2B2, 2B3), 10 mgkg ! (subjects 1C1, 1C3, 1C4, 2C2, 2C4, 2CS5) or 
30 mg kg! (subjects 1E2, 1E3, 1E5, 2D1, 2C1, 2D3, 2E1, 2E2, 2E3, 2E4, 2E5), at a rate 
of 100 or 250mlh™'. All participants provided written informed consent before 
participation in the study and the study was conducted in accordance with Good 
Clinical Practice. The protocol was approved by the Federal Drug Administration in 
the USA, the Paul Ehrlich Institute in Germany and the Institutional Review Boards at 
the Rockefeller University and the University of Cologne. 

Study participants. All study participants were recruited at the Rockefeller 
University Hospital, New York, USA and at the University Hospital Cologne, 
Cologne, Germany. Eligible subjects were adults aged 18-65 years, HIV-1-infected 
or uninfected, and without concomitant hepatitis B or C infections. HIV-1-infected 
subjects enrolled in study groups 2A through 2E were ART-experienced or naive. In 
groups 2A through 2D, subjects were either off standard ART for at least 8 weeks 
before study participation and had plasma HIV-1 RNA levels between 2,000 and 
100,000 copies ml” ', or they were on standard ART but had plasma HIV-1 RNA 
levels >20 copies ml~', measured on two separate occasions at least 1 week apart. 
Subjects recruited into group 2E were HIV-1-infected, off ART (2,000-100,000 
copies ml~'), and differed from the other groups in that they were pre-screened 
for sensitivity to 3BNC117 as described below. Subjects with CD4* T-cell counts 
<300 cells pul, clinically relevant deviations from normal physical findings, abnor- 
mal electrocardiogram (ECG), and/or laboratory examinations were excluded. 
Women of childbearing potential were required to have a negative result of a serum 
pregnancy test on the day of 3BNC117 infusion. HIV-1-infected individuals, who 
were not on standard ART at enrolment, were given the option to initiate ART 
6 weeks after 3BNC117 infusion. 

Study procedures. The appropriate volume of 3BNC117 was calculated according 
to study dose group, diluted in sterile normal saline to a total volume of 100 or 
250 ml, and administered intravenously over 60 min. Study participants received 
3BNC117 on day 0 and remained under close monitoring in the inpatient unit of 
the Rockefeller University Hospital for 24h. Participants returned for frequent 
follow up visits for safety assessments that included physical examination, mea- 
surement of clinical laboratory parameters such as haematology, chemistries, 
urinalysis, coagulation times, pregnancy tests (for women) as well as HIV-1 viral 
loads and CD4* and CD8* T-cell counts (Fig. 1a). Adverse events were graded 
according to the DAIDS AE Grading Table (HIV-1-infected groups) or the 
Toxicity Grading Scale for Healthy Adult and Adolescent Volunteers Enrolled 
in Preventive Vaccine Clinical Trials (uninfected groups). Blood samples (30 to 
120 ml) were collected before and at multiple times after 3BNC117 infusion. 
Samples were processed within 4h of collection, and serum and plasma samples 
were stored at -80°C. PBMCs were isolated by density gradient centrifugation. 
The absolute number of peripheral blood mononuclear cells was determined by an 
automated cell counter (Vi-Cell XR; Beckman Coulter), and cells were cryopre- 
served in fetal bovine serum plus 10% DMSO. 

Plasma HIV-1 RNA levels. Plasma was collected for measuring HIV-1 RNA 
levels at screening (from day —-49 to day — 14), the day -7 pre infusion visit (from 
day —42 to day —2), day 0 (before infusion), and on days 1, 4, 7, 14, 21, 28, 42 and 
56. HIV-1 RNA levels were determined using the Roche COBAS AmpliPrep/ 
COBAS TaqMan HIV-1 Assay, Version 2.0, which detects 20 to 10 X 10° copies 
ml ', or by the ABBOTT RealTime Assay, which detects 40 to 10 X 10° copies 
ml '. In samples with HIV-1 RNA <20 copies ml” ', viraemia was measured by a 
quantitative real-time, reverse transcriptase (RT)-initiated PCR (RT-PCR) assay 
that can quantify HIV-1 RNA down to 1 copy ml’ as previously described". 
CD4* and CD8* T cells. CD4* and CD8* T-cell counts were determined at 
screening, on day 0 (before infusion), and day 28 by a clinical flow cytometry assay, 
performed at LabCorp or at the University Hospital Cologne. Leukocytes were 
determined as CD45" cells. Percentage of cells positively stained for CD3, CD4, 
CD8 as well as the CD4/CD8 ratio were analysed with the BD Multiset software 
(BD Biosciences). 

3BNC117 study drug. 3BNC117 is a recombinant, fully human IgGlk mAb 
recognizing the CD4 binding site on the HIV-1 envelope’. The antibody was 
cloned from an HIV-1-infected viraemic controller in the International HIV 
Controller Study''*’, expressed in Chinese hamster ovary cells (clone 5D5- 
5C10), and purified using standard methods. The 3BNC117 drug substance was 
produced at Celldex Therapeutics Fall River (MA) GMP facility, and the drug 
product was fill-finished at Gallus BioPharmaceuticals (NJ). The resulting purified 
3BNC117 was supplied as a single use sterile 20 mg ml ' solution for intravenous 


injection in 8.06mM sodium phosphate, 147mM_ potassium phosphate, 
136.9 mM sodium chloride, 2.68 mM potassium chloride, and 0.01% polysorbate 
80. 3BNC117 vials were shipped and stored at 4 °C. 

Measurement of 3BNC117 serum levels. Serum levels of 3BNC117 were deter- 
mined by using two separate methods (ELISA and TZM.bl). 3BNC117 serum 
concentrations were measured by a validated sandwich ELISA. Plates (Sigma- 
Aldrich PN: CLS3590 96-well, High Bind, polystyrene) were coated with 4 jig ml! 
of an anti-idiotypic antibody specifically recognizing 3BNC117 (anti-ID 1F1 
mAb), and incubated overnight at 2-8 °C. After washing, plates were blocked 
for 1h with 5% BSA. Serum samples, QCs and standards were added (1:50 min- 
imum dilution in 5% BSA) and incubated for 1 h at room temperature. 3BNC117 
was detected using an horseradish peroxidase (HRP)-conjugated mouse anti- 
human IgG kappa-chain-specific antibody (Abcam PN: ab79115) and the HRP 
substrate tetra-methylbenzidine. 3BNC117 concentrations were then interpolated 
from a standard curve of 3BNC117 using a 4 parameter logistic curve-fitting 
algorithm. The reference standard and positive controls were created from the 
drug product lot of 3BNC117 used in the clinical study. The capture anti-idiotypic 
antibody was produced by immunizing BALB/c mice with a Fab’ fragment of 
3BNC117 and plasma was tested for the presence of neutralizing antibodies in an 
ELISA. Briefly, the HIV-1 antigen (2CC core protein) was coated to a plate and 
blocked. Plasma dilutions were pre-incubated with a sub-saturating concentration 
of 3BNC117 then added to the plate. Binding to the antigen was detected with an 
HRP-conjugated goat anti-mouse IgG-Fc specific antibody. Plasma was consid- 
ered neutralizing if it was able to block the binding of 3BNC117 to the antigen 
coated plate. Two mice were selected for fusion. Those hybridomas that showed 
high specificity when comparing binding to 3BNC117 versus binding to the irrel- 
evant human IgG1 antibody were selected to expand, screened in the neutraliza- 
tion assay as described above, subcloned and purified for use in the anti-idiotype- 
specific ELISA. 

In addition, the concentration of active 3BNC117 was determined by TZM.bl 
neutralization assay’*. Serum samples were heat-inactivated for 1h at 56°C and 
measured for neutralizing activity against an HIV-1 strain that was highly sensitive 
to 3BNC117 but resistant to any autologous HIV-1 neutralizing serum activity. In 
all uninfected subjects serum samples were tested against Q769.d22 and IDso 
values were derived by using a 5-parameter curve fitting, considering accurate 
within the pre-established limits (threefold variation with a 20% error rate). The 
serum concentration of active 3BNC117 was calculated by taking into account the 
sera [Dso titres multiplied by the known ICs9 of 3BNC117 for Q769.d22. In HIV-1- 
infected subjects pre-infusion samples were first tested against a panel of 
3BNC117-sensitive HIV-1 strains that included Q769.d22 (Clade Al, Tier 2), 
YU2.DG (Clade B, Tier 2), Q259.d2.17 (Clade C, Tier 1B), Q842.d12 (Clade Al, 
Tier 2), ZM135M.PL10a (Clade C, Tier 2), and TRO.11 (Clade B, Tier 2). A single 
strain per subject was selected that showed no or only minimal background 
activity and 3BNC117 serum levels were determined in the same way as described 
for Q769.d22. 

Pharmacokinetic analysis. Blood samples were collected immediately before, at 
the end, 0.5, 3, 6, 9, 12 and 24h after completion of the 3BNC117 infusion, and on 
days 2, 4,7, 14, 21,28, 42 and 56. 3BNC117 serum levels were obtained from ELISA 
(Celldex Therapeutics) and TZM.bl neutralization assay, and PK-parameters 
were estimated by performing a non-compartmental analysis (NCA) using 
WinNonlin 6.3. 

Neutralization assay. Serum samples, viral supernatants, and control antibodies 
were tested against HIV-1 envelope pseudoviruses as previously described”. 
Virus cultures. Autologous virus was retrieved from HIV-1 infected individuals 
as previously described”. Briefly, healthy donor peripheral blood mononuclear 
cells (PBMCs) were obtained by leukapheresis from a single donor. Cells were 
cultured at a concentration of 5 X 10° ml! in Iscove’s modified Dulbecco’s med- 
ium (IMDM; Gibco) supplemented with 10% fetal bovine serum (FBS; HyClone, 
Thermo Scientific), 1% penicillin/streptomycin (Gibco), and 1 1g ml’ phytohae- 
magglutinin (Life Technologies) at 37 °C and 5% COp. After 2-3 days, 5 X 10° cells 
were transferred into IMDM supplemented with 10% FBS, 1% penicillin/strep- 
tomycin, 5 pg ml! polybrene (Sigma), and 10 U ml! of IL-2, and co-incubated 
with 2-3 X 10° PBMCs from the study participants obtained before and 28 days or 
later after 3BNC117 infusion (Extended Data Table 3). Media was replaced on a 
weekly basis and culture supernatants quantified using the Alliance HIV-1 p24 
Antigen ELISA kit (PerkinElmer) according to the manufacturer’s instructions. 
Tissue culture infectious dose for 50% of cells (TCID59) values were determined for 
all HIV-1 containing supernatants**** and then tested for sensitivity against 
3BNC117 ina TZM.bI neutralization assay. Outgrowth of autologous virus isolates 
from subjects with low level viraemia was performed as previously described**. 
Briefly, 2-5 million CD4* T cells were cultured in the presence of 10 million 
irradiated healthy donor PBMCs and 3 million healthy donor PHA stimulated 
CD8* depleted lymphoblasts. Lymphoblasts were replenished weekly by adding 3 
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million healthy donor PHA stimulated CD8* depleted lymphoblasts. Blood sam- 
ples and leukopheresis were collected under separate IRB-approved protocols and 
after study participants provided informed consent. 

Sequence analysis. HIV-1 RNA was extracted from plasma samples using the 
Qiagen MinElute Virus Spin kit (Qiagen) followed by first strand cDNA synthesis 
using SuperScript III reverse transcriptase (Invitrogen Life Technologies) and the 
antisense primer env3out 5'-TTGCTACTTGTGATTGCTCCATGT-3’” or 5’- 
GGTGTGTAGTTCTGCCAATCAGGGAAGWAGCCTTGTG-3"°. gp160 env 
was amplified using envB5out 5’-TAGAGCCCTGGAAGCATCCAGGAAG-3’ 
and envB3out 5'-TTGCTACTTGTGATTGCTCCATGT-3’ in the first round 
and second round nested primers envB5in 5’-CACCTTAGGCATCTCCTATG- 
GCAGGAAGAAG-3’ and envB3in 5’-GTCTCGAGATACTGCTCCCACCC-3’. 
First round PCR was performed using a High Fidelity Platinum Taq (Invitrogen) 
at 94 °C, 2 min; (94°C, 15s; 55 °C 30s; 68 °C, 4 min) X 35; 68 °C, 15 min. Second 
round PCR was performed with 2 ul of 1. PCR product as template and Phusion 
Hot Start Polymerase at 98 °C, 30s; (98°C, 8s; 55°C, 20s; 72°C, 1 min) X 35; 
72°C, 6 min. gp120 env was amplified using first round primers and conditions as 
described, but second round was performed by using second round nested primers 
5'-TAGAAAGAGCAGAAGACAGTGGCAATGA-3’ and 5'- 
TCATCAATGGTGGTGATGATGATGTTTTTCTCTCTGCACCACTCTTCT- 
3’. Second round PCR was performed with 1 1l of 1. PCR product as template and 
High Fidelity Platinum Taq (Invitrogen) at cycling conditions, 94°C, 2 min; 
(94°C, 15s; 58°C, 35s; 68°C, 2min and 30s) X 35; 68°C, 10 min. Following 
the second-round PCR amplification, 0.5 ul Taq polymerase was added to each 
50 pl reaction and an additional 72 °C extension for 15 min was performed to add 
3'dA overhangs for cloning inserts into pCR4-TOPO. PCR amplicons were gel- 
purified and ligated into pCR4-TOPO (Invitrogen) or pcDNA3.1 (Invitrogen), 
followed by transfection into MAX Efficiency Stbl2 Competent Cells (Life 
Technologies). Individual colonies were analysed for insert length by PCR, and 
successfully cloned envelopes sequenced by a set of env specific primers. Sequence 
alignments and mutation analysis of gp120 and gp160 was performed by using 
Geneious Pro software, version 5.6.7 (Biomatters Ltd), and residues were num- 
bered according to HXBc2. Phylogenetic trees were generated using the PhyML 
tool at the Los Alamos HIV website**, and sequence analysis was performed using 
Antibody database by Anthony West”. Logograms were generated using the 
Weblogo 3.0 tool”. Selected sequences were used to generate pseudoviruses and 
tested for 3BNC117 sensitivity in a TZM.bl assay”. 

Statistical analyses. The sample size to detect >1 log), decline in viraemia with 
80% power at 5% of significance was determined to be 5 HIV-1-infected indivi- 
duals, not on ART, infected with 3BNC117-sensitive viruses assuming that the 
standard deviation would be similar to 3BNC117 effects in hu-mice’. Adverse 
events were summarized by the number of subjects who experienced the event, 
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by severity grade and by relationship to 3BNC117 according to the DAIDS AE 
Grading Table (HIV-1-infected groups) or the Toxicity Grading Scale for Healthy 
Adult and Adolescent Volunteers Enrolled in Preventive Vaccine Clinical Trials 
(uninfected groups). PK-parameters were estimated by performing a non-com- 
partmental analysis (NCA) using WinNonlin 6.3. CD4* and CD8* T-cell counts 
before and after 3BNC117 were analysed by one-way ANOVA. To assess the 
changes in HIV-1 viral loads, we used a mixed-effect linear model where dose 
and time were fixed effects and random intercepts for each participant. These 
models take full advantage of the repeated measure structure of the data while 
estimating parameters for each dose simultaneously, hence improving the power 
of small study groups. The final model was fitted assuming an AR(1) correlation 
structure over time, which was the best in terms of AIC/BIC criteria. The signifi- 
cance of the effect of 3BNC117 on viral load, defined as change between each time 
point and day 0, was assessed using least-squares means within each dose group 
(Extended Data Table 2b). Sensitivity analysis was also carried out with variations 
of these models and the same conclusions were achieved. Pearson and nonpara- 
metric Spearman coefficients were calculated to assess the correlation between 
maximum drop in viraemia after 3BNC117 infusion and baseline HIV-1 viral load 
or baseline sensitivity of autologous viruses to 3BNC117. 
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4 3BNC117 


number of F Coverage (%) 
Clade tested strains Mean !Cs (IC5o <10 pg/ml) 
A 57 0.04 86.0 
AE 22 0.08 90.9 
B 49 0.08 91.8 
Cc 84 0.14 71.4 
D 17 0.11 88.2 
G 7 0.16 71.4 
Allclades 237** 0.08 82.3 


Data retrieved from AntibodyDatabase (West et a/., PNAS, 2013) 
* Geometric Mean 
** Includes 1 strain, of which no clade was determined 


Extended Data Figure 1 | HIV-1 neutralizing activity of 3BNC117. 
a, Summary of 3BNC117 neutralizing in vitro activity based on 237 HIV-1 
isolates comprising 6 different clades. Data were retrieved from the 


b 100 
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“‘AntibodyDatabase’ by A.P.W. (ref. 39). b, Illustration of the fraction (that is, % 
coverage; y axis) of HIV-1 isolates that are neutralized at a given ICso (ug ml’; 
x axis) using the same data set. 
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Extended Data Figure 2 | CD4* and CD8* T-cell counts before and after 
3BNC117 infusion. a, Absolute numbers (cells ul) of CD4* and CD8* 
T-cell counts of all enrolled HIV-1-infected participants at screen, on the day of 
3BNC117 infusion (day 0), and at day 28 after infusion. b, Percentage of CD4* 
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and CD8* T cells for the same subjects and time points. Mean and standard 
deviation are indicated in red and black, respectively. No significant differences 
between pre-infusion and post infusion levels were detected by using one-way 
ANOVA. 
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Extended Data Figure 3 | 3BNC117 serum concentration and activity in 
single subjects. a, b, Serum levels of 3BNC117 in all uninfected (a) and HIV-1- 
infected (b) individuals that received 1, 3, 10, or 30 mg kg ~ 13BNC117 at day 0. 
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Antibody levels were measured by a sandwich ELISA using an anti-3BNC117 
specific antibody (green) or by measuring the 3BNC117 serum activity in a 
TZM.bI neutralization assay (blue). 
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Extended Data Figure 4 | 3BNC117 sensitivity and changes in viraemiain 2 axis shows antibody level measured in ELISA. Blue line reflects change in VL 
ART-treated subjects. Both subjects were on ART when enrolled in the study —_ and dotted grey line antibody level. Numbers indicate IC;9 values for 3BNC117 
and received a single dose (2B2, 3mgkg '; 2C2, 10mgkg ') of 3BNC117 at _ of autologous viral isolates measured by TZM.bl assay, colour-coded as 

day 0. The left y axis shows log;y change in viraemia from baseline, and right y _ indicated on the right. 
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Extended Data Figure 5 | Correlating viral decay with 3BNC117 sensitivity 
and starting viral load. a, Maximum decline in viral load in ART-untreated 
HIV-1-infected participants with baseline 3BNC117-sensitive viruses 

(ICs9 < 1 pg ml ~ 1) versus pre-treatment (day 0) viral load (Pearson coefficient 
r= 0.72 P = 0.03; Spearman coefficient rho = 0.78, P = 0.02). b, Maximum 


drop in viral load in HIV-1-infected and viremic individuals receiving a 10 or 
30mgkg * dose of 3BNC117 (y axis) versus baseline autologous virus 
sensitivity to 3BNC117 (x axis; Pearson coefficient r = 0.69 P = 0.03; Spearman 
coefficient rho = 0.41, P = 0.23). 
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Extended Data Table 1 | Baseline characteristics of HIV-1-infected individuals and 3BNC117 safety data 


a 


Study ID 3BNC117 dose 
2A1 
2A3 
2A4 
2B1 
2B2 
2B3 
2C2 
2C4 
2C5 
2D1 
2C1 
2D3 
2E1 
2E2 
2E3 
2E4 
2E5 


1 mg/kg 
1 mg/kg 
1 mg/kg 
3 mg/kg 
3 mg/kg 
3 mg/kg 
10 mg/kg 
10 mg/kg 
10 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 
30 mg/kg 


Adverse Events 


= 
j=) 


Rhinorrhea and/or cough 
Malaise 
Headache 
Diarrhea 
Myalgia/arthralgia (localized) 
Sore throat 
Tenderness 
Increased Lacrimation 
Myalgia 
Chills 
Conjunctival erythema 
Fevershiness 
Nausea 
Pruritus 
Blurry vision 
Decreased appetite 
Erythema 
Paresthesia upper extremity 
Shingles 
Vomiting 


PPP BSB BANNNNNNNWAHLAION 


oO. 
AEs 


Age 


% of 


AEs 


Years 
since HIV 
Diagnosis 

11 
14 


No. 


related 


PBOORHOHNHONOHANANH HWAYW 
PB OR RRHARANNANANNWAWN OOO 


Current ART 


DRV/t/TDF/FTC** 


ATV/r/3TC/ZDV** 


reported possibl No. No. Hie: 
P P Y Mild Moderate Severe 


|soo0ooc°cocosos0000; NN 


j=) 


regimen 


ART naive 
Off ART 
ART naive 
Off ART 


ART naive 


Off ART 
ART naive 
ART naive 

Off ART 
ART naive 
ART naive 
ART naive 
ART naive 
ART naive 
ART naive 


ooo00o0o00 00000000000 


j=) 


Clade HIV-RNA level 
(copies/ml) 

B 3,210 

B 43,650 

B 5,340 
ND 4,090 
ND 100 

B 38,190 
ND 30 
ND 820 

B 9,260 

B 53,470 

B 47,650 
ND 640 

B 15,780 

B 6,990 
BF 22,030 
ND 32,220 
ND 3,610 


Uninfected (No. of AEs) 


1 3 10 30 
mg/kg mg/kg mg/kg mg/kg 


1 1 0 1 
1 1 2 0 
0 0 0 1 
1 1 0 0 
0 0 0 0 
1 1 0 0 
0 0 0 1 
0 0 0 1 
0 0 0 0 
0 1 0 0 
0 0 1 0 
0 0 0 0 
0 0 0 1 
0 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 1 
0 0 0 0 
0 0 0 0 
0 0 0 0 
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abs. CD4* T cell 
count (day 0; 
cells/mm*) 
674 
520 
607 
264* 
706 
777 
728 
805 
245* 
980 
1129 
618 
847 
513 
590 
603 
532 


HIV-1-infected (No. of AEs) 


1 


3 10 30 


mg/kg mg/kg mg/kg mg/kg 


BION CeO. OC) aks SE a ea ek ea 


0 0 6 
2 0 0 
0 1 3 
0 1 1 
0 3 0 
1 1 0 
0 1 0 
0 0 1 
0 1 1 
0 0 0 
0 0 0 
1 0 0 
0 0 0 
0 0 2 
0 1 0 
0 0 0 
0 0 0 
0 0 0 
0 0 1 
0 0 0 


a, Baseline characteristics of HIV-1-infected individuals. *Absolute CD4 T-cell count was 309 and 302 cells wt at screening. **DRV/r/TDF/FTC, darunavir, ritonavir tenofovir, emtricitabine; ATV/r/3TC/ZDV, 
atazanavir, ritonavir, lamividune, zidovudine. ND, not determined. b, 3BNC117 safety data. AE, adverse events. Subject 2D3 developed herpes zoster involving a lumbar dermatome 35 days after infusion. The 
event was graded as moderate and considered not related to 3BNC117. 
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Extended Data Table 2 | HIV-1 RNA levels and viral decay mixed-effect linear model 


a b 


2A1 HIV-1 RNA Level 2A3 HIV-1 RNA Level 2A4 HIV-1 RNA Level 
Timepoint c/ml logy | Alogio Timepoint c/ml logio | A logis Timepoint c/ml logis | A logio 
Screen 1,340 3.13 Screen 41,670 | 4.62 - Screen 5,350 3.73 - 
Pre 1,100 3.04 Pre 48,100 | 4.68 Pre 6,490 | 3.81 
Day 0 3,210 3.51 Day 0 43,560 | 4.64 Day 0 5,340 | 3.73 
Day 1 10,810] 4.03 Day 1 50,180 | 4.70 Day 1 10,880 | 4.04 
Day 4 4,970 3.70 Day 4 24,050 | 4.38 Day 4 8,430 | 3.93 
Day 7 3,190 3.50 Day 7 34,850 | 4.54 Day 7 4,870 | 3.69 
Day 14 2,600 3.41 Day 14 31,770 | 4.50 Day 14 11,000 | 4.04 
Day 21 3,820 3.58 Day 21 35,840 | 4.55 Day 21 7,470 | 3.87 
Day 28 5,070 3.71 Day 28 38,500 | 4.59 Day 28 7,880 | 3.90 
Day 42 3,570 3.55 Day 42 59,780 | 4.78 Day 42 12,150 | 4.08 
Day 56 6,370 3.80 Day 56 63,600 | 4.80 Day 56 14,020 | 4.15 


Contrast Average 
Screen - Day 0 -0.133 
Day -7 - Day 0 -0.112 
Day 1 - Day 0 0.299 
Day 4 - Day 0 0.043 
Day 7 - Day 0 -0.047 
Day 14 - Day 0 0.028 
Day 21 - Day 0 0.046 
Day 28 - Day 0 0.105 
Day 42 - Day 0 0.180 
Day 56 - Day 0 0.294 


1 mg/kg 
1 mg/kg 


2Bi1 HIV-1 RNA Level 2B2 HIV-1 RNA Level 2B3 
Timepoint c/ml logo | Alogio Timepoint c/ml login | A logis Timepoint 

Screen 6,420 3.81 Screen 80* 1.90 Screen 

Pre 4,950 3.69 Pre 30 1.48 Pre 
Day 0 4,090 | 3.61 Day 0 100 2.00 Day 0 
Day 1 6,470 | 3.81 Day 1 140 2.15 Day 1 
Day 4 2,690 | 3.43 Day 4 190 2.28 Day 4 
Day7 7,610 | 3.27 Day7 20 7.30 Day7 
Day 14 4,880 | 3.69 Day 14 130 2.11 Day 14 
Day 21 4,160 | 3.62 Day 21 90 1.95 Day 21 
Day 28 3,800 | 3.58 Day 28 70 1.85 Day 28 
Day 42 4,970 | 3.70 Day 42 90 1.95 Day 42 
Day 56 3,550 | 3.55 Day 56 30 1.48 Day 56 


Contrast Average 
Screen - Day 0 0.172 
Day -7 - Day 0 -0.020 
Day 1 - Day 0 0.061 
Day 4 - Day 0 -0.351 
Day 7 - Day 0 -0.338 
Day 14 - Day 0 0.024 
Day 21 - Day 0 0.022 
Day 28 - Day 0 -0.063 
Day 42 - Day 0 -0.017 
Day 56 - Day 0 -0.100 


3 mg/kg 


3 mg/kg 


2C2 HIV-1 RNA Level 2C4 HIV-1 RNA Level 2C5 
Timepoint c/ml login | Alogio Timepoint c/ml logio | A logis Timepoint 

Screen 20 1.30 Screen 2,510 3.41 Screen 

Pre 1.48 Pre 1,100 3.04 Pre 
Day 0 1.48 Day 0 820 2.91 Day 0 
Day 1 1.78 Day 1 690 2.84 Day 1 
Day 4 1.70 Day 4 980 2.99 Day 4 
Day 7 0.96 Day 7 990 3.00 Day 7 
Day 14 0.79 Day 14 700 | 2.85 Day 14 
Day 21 1.30 Day 21 1,420 3.15 Day 21 
Day 28 0.23 Day 28 630 | 2.80 Day 28 
Day 42 0.45 Day 42 980 | 2.99 Day 42 
Day 56 1.32 Day 56 940 | 2.97 Day 56 


Contrast Dose Average 
Screen - Day 0 10mg/kg 0.415 
Day -7 - DayO 10mg/kg 0.098 
Day 1 - Day 0 10mg/kg -0.159 
Day 4 - Day 0 10mg/kg -0.260 
Day 7 - Day 0 10mg/kg -0.636 
Day 14 - Day 0 10mg/kg -0.503 
Day 21 - Day 0 10mg/kg 0.111 
Day 28 - Day 0 10mg/kg -0.075 
Day 42 - Day 0 10mg/kg 0.102 
Day 56 - Day 0 10mg/kg 0.101 


10 mg/kg 


10 mg/kg 


2D1 HIV-1 RNA Level 2C1 HIV-1 RNA Level 2D3 
Timepoint c/ml logy | Alogio Timepoint c/ml login | A logis Timepoint 

Screen 5,730 3.76 Screen 31,870 | 4.50 Screen 

Pre 34,840] 4.54 Pre 31,870 | 4.50 Pre 
Day 0 53,470 | 4.73 Day 0 47,650 | 4.68 Day 0 
Day 1 84,450 | 4.93 Day 1 46,040] 4.66 Day 1 
Day 4 36,140 | 4.56 Day 4 7,790 3.89 Day 4 
Day 7 5,980 | 3.78 Day 7 7,470 3.87 Day 7 
Day 14 4,830 | 3.68 Day 14 19,360 | 4.29 Day 14 
Day 21 15,180] 4.18 Day 21 30,380 | 4.48 Day 21 
Day 28 10,960] 4.04 Day 28 25,610 | 4.41 Day 28 
Day 42 7,650 | 3.88 Day 42 22,830 | 4.36 Day 42 
Day 56 9,820 | 3.99 Day 56 32,310 | 4.51 Day 56 


Contrast Dose Average 
Screen - Day 0 30mg/kg -0.084 
Day -7 - DayO 30mg/kg 0.022 
Day 1 - Day 0 30mg/kg 0.065 
Day 4 - Day 0 30mg/kg -0.458 
Day 7 - Day 0 30mg/kg -1.043 
Day 14-Day0| 30mg/kg -1.181 
Day 21-Day0| 30mg/kg -0.866 
Day 28-Day0 | 30mg/kg -0.515 
Day 42 - Day 0 30mg/kg -0.339 
Day 56 - Day 0 30mg/kg -0.371 


30 mg/kg 


30 mg/kg 


2E1 RNA Level 2E2 HIV-1 RNA 2E3 
Timepoint logio | A logio Timepoint c/ml logyo Timepoint 
Screen 3.94 Screen 3,394 3.53 Screen 
Pre 4.08 Pre 1,802 3.26 Pre 
Day 0 4.20 Day 0 6,990 | 3.84 Day 0 
Day 1 4.47 Day 1 6,450 | 3.81 Day 1 
Day 4 3.55 Day 4 2,340 | 3.37 Day 4 
Day 7 2.61 Day7 7,663 | 3.22 Day7 
Day 14 2.67 Day 14 253 | 2.40 Day 14 
Day 21 3.93 Day 21 917 | 2.96 Day 21 
Day 28 3.91 Day 28 4,273 3.63 Day 28 
Day 42 3.82 Day 42 15,721] 4.20 Day 42 
Day 56 4.13 Day 56 3,465 3.54 Day 56 


30 mg/kg 


2E4 HIV-1 RNA Level 2E5 
Timepoint c/ml login | A logio Timepoint 
Screen 45,311] 4.66 Screen 
Pre 66,889 | 4.83 Pre 
Day 0 32220 | 4.51 Day 0 
Day 1 44980 | 4.65 Day 1 
Day 4 19,290 | 4.29 Day 4 
Day 7 3,642 3.56 Day 7 
Day 14 983 2.99 Day 14 
Day 21 5,726 | 3.76 Day 21 
Day 28 18,005 | 4.26 Day 28 
Day 42 40,079 | 4.60 Day 42 
Day 56 22,100 | 4.34 Day 56 


30 mg/kg 


a, HIV-1 RNA levels. Subjects 2B2 and 2C2 were on ART. Subject 2D3 started ART after day 42. Screen was performed between day —49 and day — 14. Viraemia measurements at ‘Pre’ were performed between 
day —42 and day —2. b, Viral decay mixed-effect linear model. 
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Extended Data Table 3 | Sensitivity of autologous virus isolates and cloned HIV-1 envelopes to 3BNC117 
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Autologous virus isolates HIV-1 envelopes cloned from plasma 
Dose ID Day post} 3BNC117 Clone Cloning Vector 3BNC117 Average 
infusion | ICs (ug/ml) | 3BNC117 ICs procedure | backbone | (ICs9; ug/ml) | (geo. mean) 
Day 0 ND 
5 ani Day 28 0.90 NP NB 
aS 
s Day 0 0.11 
D 
f= =e Day 28 3.78 a8 ed 
re Day 0 0.07 
2A4 Day 28 0.94 13.5 ND 
Day 0 0.77 
a 2B1 Day 28 520 25.9 ND 
aS 
— Day 0 >20 
D 
f= 2B2 Day 28 ND NE me 
o 
Day 0 0.20 
2B3 Day 28 0.30 1.5 ND 
Day 0 0.49 
2C2 Day 56 003 0.1 ND 
Day 0 >20 
2C4 Day 28 20 No change ND 
2 2C5_D0_12 gp120 pSVIIl 0.015 
2 Day 0 0.09 2C5_D0_21 gp120 pSVIII 0.017 0.02 
oO 2C5_DO_27 gp120 pSVIII 0.017 
SG 2C5_W4_59 gp120 pSVIIl 11.543 
iia tere 2C5_W4_22 gp120 pSVIIl 6.737 
Day 28 15.36 2C5_W4_27 gp120 pSVIII 7.514 7.09 
2C5_W4_28 gp120 pSVIIl 3.495 
2C5_W4_34 gp120 pSVill 8.758 
2C1_D0_12 gp120 pSVIIl 0.209 
Day -7 0.54 2C1_D0_22 gp120 pSVIIl 0.158 0.06 
2c1 18 2C1_D0_32 gp120 pSVill 0.006 
. 2C1_W4_12 gp120 pSVIIl 0.043 
Day 28 1.78 2C1_W4_18 gp120 pSVIIl 0.225 0.14 
2C1_W4_ 31 gp120 pSViIl 0.262 
2D1_D0_D5 gp160 pcDNA3.1 0.165 
Day 0 0.68 2D1_D0_B3.1 gp160 pcDNA3.1 0.128 0.15 
2D1_D0_B10 gp160 pcDNA3.1 0.172 
2D1 1.3 2D1_W4_37 gp120 pSVill 0.578 
2D1_W4_40 gp120 pSVIII 0.501 
Day 26 2D1_W4_69 gp120 pSViIl 0.465 wae 
2D1_W4_71 gp120 pSVIII 0.523 
Day 0 0.13 
2D3 Day 28 035 2.7 ND 
D 
= 2E1_D0_12 gp160 pcDNA3.1 0.103 
2 Screen 0.40 2E1_D0_20 gp160 pcDNA3.1 0.115 0.09 
(= 2E1_D0_34 gp160 pcDNA3.1 0.068 
oe = NE 2E1_W4_23 gp160 pcDNA3.1 0.041 
Day 28 ND 2E1_W4_E1 gp160 pcDNA3.1 0.590 0.23 
2E1_W4_F6 gp160 pcDNA3.1 0.496 
2E2_D0_A10 gp160 pcDNA3.1 0.017 
Screen 0.18 2E2_D0_C3 gp160 pcDNA3.1 0.010 0.01 
2E2 12.3 2E2_D0_E9 gp160 pcDNA3.1 0.018 
. 2E2_W4_B9 gp160 pcDNA3.1 0.017 
Day 28 2.23 2E2_W4_C11 gp160 pcDNA3.1 0.020 0.03 
2E2_W4_D5 gp160 pcDNA3.1 0.057 
Screen 0.18 
2E3 Day 28 1.10 6.1 ND 
Screen 0.24 
2E4 Day 28 ND ND ND 
Screen 0.30 
2E5 Day 28 ND ND ND 


ND, not determined. 
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Extended Data Table 4 | Pharmacokinetics of 3BNC117 based on a 56-day period post infusion 


3BNC117 Cmax adjusted Estimated Lambda Lambda AUC m Clast AUC_% 

ID AOPUART imag) ee (ug/ml) R-squared T,, (days) (lower)* — (upper)* (INF prea) "!2# (ug/ml)  Extrap_pred 
fn Glens F ELISA 274 nd. nd. i - nd. : : nd. 
g TZM.b 274 0.936 21.32 14 42 184.0 42 12 19.2 
ELISA 18.8 nd. nd. : - nd. - nd. 
1A2 Negative | TZM.b 278 0.893 12.89 7 42 181.7 42 1.0 9.0 
ELISA 11.2 nd. nd. - - nd. - - nd. 
Jag. <Negalive : TZM.b 23.5 0.976 10.38 4 28 138.0 28 13 14.0 
2A1 PositivelOff 1 ELISA 15.7 nd. nd. : nd. : 2 nd. 
2A3 PositivelOff 1 ELISA 22.7 nd. nd. : : nd. : . nd. 
7 ELISA 33.8 nd. nd. - : nd. - - nd. 
eed Posten TZM.b 15.6 nd. nd. - : nd. : - nd. 
ck aie ; ELISA 89.9 0.985 19.64 7 56 685.2 56 3 12.1 
g TZM.b 70.9 0.881 24.30 7 56 483.7 56 27 179 
tea. enie ‘ ELISA 90.1 0.878 12.91 7 42 523.5 42 22 8.8 
g TZM.b 116.0 0.936 10.04 7 42 481.2 42 14 46 
se. “niedetie . ELISA 243.4 0.954 21.54 14 56 1017.3 56 42 12.4 
g TZM.b 74.5 0.954 21.70 7 56 706.6 56 3.9 15.6 
_ ELISA 90.8 0.942 9.60 4 21 219.4 21 28 19.2 
281 FosiiverOn  -s TZM.b 57.4 0.994 10.28 7 28 245.9 28 19 12.1 
2B2 Positive(On 3 ELISA 97.6 0.928 8.74 7 21 200.7 21 25 16.5 
- ELISA 60.0 0.981 9.08 4 21 251.2 21 35 17.0 
be Posie TZM.b 32.6 0.994 9.19 4 28 129.8 28 1.0 10.1 
ici. aces om ELISA 347.8 0.942 16.33 7 56 1667.9 56 59 75 
g TZM.b 281.8 0.964 15.79 7 56 1598.9 56 56 73 
fee Nees ‘6 ELISA 308.9 0.860 29.01 7 56 2529.4 56 15.8 23.5 
g TZM.b 244.7 0.700 26.59 14 56 2228.5 56 13.1 193 
Wick teicoa ee ELISA 180.6 0.835 10.73 7 42 859.1 42 4.0 55 
g TZM.b 230.7 0.990 10.56 14 56 1022.0 56 14 22 
2C2 PositivefOn 10 ELISA 326.8 0.866 10.87 7 42 904.5 42 27 56 
, ELISA 178.2 0.944 6.92 4 28 550.2 28 28 5.1 
24. Foster 10 TZM.b 193.0 0.979 6.14 4 28 417.9 28 18 34 
2 ELISA 201.1 0.997 6.60 7 28 592.6 28 26 44 
eae, Fostverot 10 TZM.b 174.4 0.994 6.50 7 28 431.2 28 18 37 
ee eas - ELISA 360.8 0.930 14.48 7 56 4259.4 56 13.6 59 
gativ TZM.b 1166.3 0.978 16.19 14 56 5901.2 56 16.1 62 
s,s “ ELISA 361.8 0.990 16.26 7 56 3177.5 56 11.7 8.2 
g TZM.b 606.2 0.989 17.85 7 56 3422.2 56 123 93 
jee “ase ELISA 765.0 0.992 14.15 4 56 5874.2 56 16.3 57 
g TZM.b 939.6 0.959 15.22 14 56 6424.5 56 19.1 61 
7 ELISA 410.2 0.991 5.80 7 28 1707.8 28 51 26 
201 FOstNneIOn. «720 TZM.b 717.4 0.993 5.99 7 42 2186.4 42 16 06 
. ELISA 976.4 0.996 6.86 7 42 2494.9 42 3.4 13 
2b1 (Positive. 30 TZM.b 849.3 0.990 8.83 7 56 1825.5 56 14 08 
: ELISA 571.0 0.962 13.39 7 56 3616.5 56 9.9 48 
eDexPosiiverp, 20 TZM.b 953.7 0.993 11.23 7 42 4346.2 42 16.0 6.0 
2E1 PositiveOff 30 ELISA 712.4 0.970 11.44 14 56 3028.5 56 46 23 
2E2 PositivelOff 30 ELISA 789.4 0.920 11.14 7 56 3596.1 56 67 24 
2E3 PositivelOff 30 ELISA 559.6 0.821 8.54 7 28 2495.6 28 13.7 84 


Estimation of PK parameters retrieved from study participants (ID) up to 8 weeks post 3BNC117 infusion. 3BNC117 was administered at 1, 3, 10, and 30 mgkg 2 in uninfected (Negative) and HIV-1-infected 
(Positive) individuals. 3BNC117 serum concentrations were determined by ELISA or TZM.bl assay (see Methods). PK-parameters were obtained by performing a non-compartmental analysis (NCA) using 
WinNonlin 6.3. T,/2 was estimated using values between Lambda upper/lower. n.d., PK parameters were not determined because of insufficient data or an extrapolated AUC_inf that exceeded 25%. Values in red 
indicate extrapolated AUC_inf >10%. *Days post infusion of 3BNC117. 
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Extended Data Table 5 | Summary of 3BNC117 pharmacokinetics based on a 56-day period post infusion 
Method 


Dose HIV-1-status Subjects (subjects analyzed) Cmax (yg/ml) tare (days) (1) 
Mean SD Range Mean SD Range 
din Nie 3 ELISA (3) 19.1 8.1 11.2 - 27.4 n.d. n.d. n.d. 
9 2 TZM.bl (3) 26.2 2.4 23.5 - 27.8 n.d. n.d. n.d. 
din Pos 3 ELISA (3) 24.1 9.1 15.7 - 33.8 n.d. n.d. n.d. 
8 : TZM.bl (1) 23.5 n.d. n.d. n.d. n.d. n.d. 
ai Ne 3 ELISA (3) 141.1 88.6 89.9 - 243.4 18.0 4.5 12.9 - 21.5 
9 2 TZM.bl (3) 87.1 25.1 70.9 - 116.0 18.7 7.6 10.0 - 24.3 
ELISA (3) 73.7 35.7 32.6 - 97.6 9.2 0.4 8.7 - 9.6 
3 mg Pos. 2 
TZM.bl (3) 50.6 15.7 32.6 - 61.7 11.2 2.2 9.5 - 13.73 
46°n Ne 3 ELISA (3) 279.1 87.5 180.6 - 347.8 18.7 9.4 10.7 - 29.0 
g 2 TZM.bl (3) 252.4 26.4 230.7 - 281.8 17.7 8.1 10.7 - 26.6 
ELISA (3) 235.4 80.0 178.2 - 326.8 8.1 2.4 6.6 - 10.9 
10 mg Pos. 3 
TZM.bl (3) 213.6 52.6 174.4 - 273.4 8.8 4.3 6.5 - 13.7 
30 m Ne 3 ELISA (3) 495.9 233.1 360.8 - 765.0 15.0 14 14.2 - 16.3 
g 9. TZM.bl (3) 904.0 281.7 1166.3 - 606.2 16.4 A) 15.2 -17.9 
30 m Pos 6 ELISA (6) 669.8 199.7 410.2 -976.4 9.9 3.3 5.8 - 13.7 
g : TZM.bl (3) 840.1 118.4 717.4 - 953.7 8.9 29 6.7 - 11.2 
ELISA (9) - - - 17.2 D:5' 10.7 - 29.0 
fl Neg: i2 TZM.bl (9) z : : 17.6 57 10.0-26.6 
ELISA (12) - - - 9.3 2.6 5.7 - 13.7 
Bu res 1s TZM.bl (9) - - - 9.6 2.9 6.1 - 13.7 


(1) Estimation of half-lives; SD, standard deviation. 
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Cell death during crisis is mediated by mitotic 


telomere deprotection 


Makoto T. Hayashi’, Anthony J. Cesare’, Teresa Rivera’ & Jan Karlseder' 


Tumour formation is blocked by two barriers: replicative sen- 
escence and crisis’. Senescence is triggered by short telomeres 
and is bypassed by disruption of tumour-suppressive pathways. 
After senescence bypass, cells undergo crisis, during which almost 
all of the cells in the population die. Cells that escape crisis harbour 
unstable genomes and other parameters of transformation. The 
mechanism of cell death during crisis remains unexplained. Here 
we show that human cells in crisis undergo spontaneous mitotic 
arrest, resulting in death during mitosis or in the following cell 
cycle. This phenotype is induced by loss of p53 function, and is 
suppressed by telomerase overexpression. Telomere fusions trig- 
gered mitotic arrest in p53-compromised non-crisis cells, indi- 
cating that such fusions are the underlying cause of cell death. 


Exacerbation of mitotic telomere deprotection by partial TRF2 
(also known as TERF2) knockdown’ increased the ratio of cells 
that died during mitotic arrest and sensitized cancer cells to mitotic 
poisons. We propose a crisis pathway wherein chromosome fusions 
induce mitotic arrest, resulting in mitotic telomere deprotection 
and cell death, thereby eliminating precancerous cells from the 
population. 

Replicative senescence is induced by partially deprotected telo- 
meres, which activate a DNA damage response (DDR) without telo- 
mere fusions’. Crisis requires the bypass of senescence through loss of 
checkpoints and causes massive cell death concomitant with further 
telomere shortening and spontaneous telomere fusions. However, 
the mechanism of cell death was not understood. Mitotic arrest is 
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Figure 1 | Spontaneous mitotic arrest upon bypass of senescence. 

a, Representative movie frames of normal mitosis. Arrows, nuclear envelope 
breakdown (NEBD) and cytokinesis. Scale bar, 10 um. b, Mitotic duration of 
individual indicated cells (upper panel). Mean mitotic duration + s.e.m. (lower 
panel) (Mann-Whitney test, compared to IMR-90 E6E7 PD36). c, Growth 


curves of indicated IMR-90 derivatives. Vec., vector. d, Mitotic duration of 
indicated derivatives as in b. Results were reproduced independently at least 
twice. e, f, Mitotic duration of IMR-90 E6E7 (e) or p53dd (f) cells expressing 
hTert as in b. Cells were infected at the indicated PD. *P < 0.05, **P < 0.005, 
***P < (0.0001. Mann-Whitney test. 
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associated with spindle assembly checkpoint (SAC)-independent tel- 
omere deprotection’, and we therefore set out to test whether pro- 
longed mitosis has a role in crisis. 

We monitored mitotic duration using live-cell imaging. Mitosis in 
primary IMR-90 fibroblasts lasted <45 min. However, IMR-90 fibro- 
blasts expressing HPV16 E6 and E7, which inhibit p53 and Rb’, dis- 
played variable mitotic duration upon senescence bypass (Fig. 1a, b). 
Prolonged mitosis, defined as mitosis of >2 h, became prominent in 
pre-crisis cells (Extended Data Fig. 1a). Spontaneous mitotic arrest also 
occurred in pre-crisis cells following expression of E6 or dominant- 
negative p53dd (Extended Data Fig. 1b), indicating that loss of p53 
function was required (Fig. 1c, d and Extended Data Fig. Ic, d). 

Overexpressing hTERT®® prevented senescence in IMR-90 cells 
(Fig. 1c, d and Extended Data Fig. Ic, d). Telomere elongation in 
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SEMEN soTRE SgEMPTY sgTRFD-2 
Figure 2 | Telomere fusions induce mitotic arrest. a, Mean percentage of 
telomeric fusions per cell + s.e.m. in IMR-90 E6E7 and derivatives 7 days post 
infection around PD45. Representative metaphases below. Scale bar, 10 jm. 
Enlarged images are 4X magnified. b, Mean mitotic duration + s.e.m. in the 
cell lines from a. Mitotic duration data of growing IMR-90 E6E7 cells are the 
same as Fig. 1b. c, d, Percentage of telomeric fusion (c) and mitotic duration 
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IMR-90 E6E7 or p53dd cells also suppressed mitotic arrest (Fig. le, f 
and Extended Data Fig. 2a-c), confirming telomere shortening 
as the cause. Reversine inhibition of MPS1 (ref. 7) suppressed mitotic 
arrest (Extended Data Fig. le), indicating dependence on the SAC. 
Hesperadin, an Aurora B kinase inhibitor required for activation of 
the SAC upon tensionless kinetochore-microtubule attachment’, sup- 
pressed mitotic arrest (Extended Data Fig. le), suggesting abnormal 
kinetochore-microtubule attachment. 

To determine if telomere fusion causes mitotic arrest, we used two 
independent guide RNAs (sgTRF2-1 and sgTRF2-2)’, which efficiently 
reduced TRF2 expression and induced telomere fusions in young 
IMR-90 E6E7 cells (Fig. 2a and Extended Data Fig. 2d, e). These guide 
RNAs also led to mitotic arrest, comparable to IMR-90 E6E7 cells 
around population doubling (PD) 108 (Fig. 2b and Extended Data 
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(d) in indicated IMR-90 E6E7 cells (mean + s.e.m.). e-g, Percentage of 
telomeric fusion (e), mitotic duration (f) and mean number of meta-TIF 
(g) in indicated cells (mean + s.e.m.). h, Scatter plots with bars showing 
the mean percentage of cells with prolonged mitosis (two independent 
experiments). *P < 0.05, **P < 0.005, ***P < 0.0001. NS, not significant. 
Mann-Whitney test. 
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Fig. 2f). Suppression of both telomere fusion and mitotic arrest by a 
resistant TRE2 (TRE2%88%4) excludes off-target effects (Fig. 2c, d and 
Extended Data Fig. 3a-c). 

To address whether telomeric DDR or telomere fusion induces 
mitotic arrest, we deleted TRF2 in young IMR-90 E6E7 cells lacking 
53BP1 or ligase IV (Extended Data Fig. 3d, e)'°’’. Suppression of 
53BP1 or ligase IV strongly reduced fusion frequency (Fig. 2e) and 
prevented mitotic arrest (Fig. 2f, h), but did not reduce the number of 
deprotected telomeres (Fig. 2g and Extended Data Fig. 3f), thereby 
separating mitotic delay from DDR. Both telomere fusion and mitotic 
arrest phenotypes were suppressed by ATM inhibition’? * (Extended 
Data Fig. 3g-j), again indicating that telomere fusion underlies mitotic 
arrest. ATM inhibition did not suppress mitotic arrest induced by 
Taxol’ (Extended Data Fig. 3k and 1), confirming that the inhibitor 
does not perturb the SAC. Additionally, cells expressing a small 
hairpin RNA targeting TRF2 (shTRF2-F), which causes telomere 
deprotection in the absence of fusion’, did not undergo arrest 
(Fig. 2a, b and Extended Data Fig. 2e, f). These data are consistent 
with the observation that senescent cells, while harbouring a number 
of unfused deprotected telomeres”! do not display mitotic arrest 
(Fig. 1b and Extended Data Fig. 1a). 

Deletion of TRF2 increased anaphase bridge frequency and pericen- 
trin foci (Extended Data Fig. 4a, b) 7 days post infection, indicating 
multipolar mitosis when cells display telomere fusions and mitotic 
arrest. Accordingly, sgTRF2-2 cells exhibit unaligned metaphase chro- 
mosomes (Extended Data Fig. 4c), suggesting a chromosome congres- 
sion defect. Tetraploidy did not increase as dramatically as the 
pericentrin foci (Extended Data Fig. 4d), ruling out tetraploidization 
and centrosome amplification due to anaphase bridges and cytokinesis 
failure, as the cause of multipolarity. We conclude that multipolarity 
contributes to the mitotic arrest phenotype, although it is not clear how 
telomere fusions drive centrosome abnormality. 

Mitotic arrest in young IMR-90 cells induces mitotic telomere 
deprotection*. To examine whether spontaneous mitotic arrest in 
pre-crisis cells also induces telomere deprotection we used premature 
sister separation, either cause or consequence of mitotic arrest'”"*, as 
a marker of prolonged mitosis. Indeed, cells in pre-crisis displayed 
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Figure 3 | Cell fate decision during telomere crisis. a, Mean percentage + s.d. 
of metaphases with separated chromatids or diplochromosomes in IMR-90 
E6E7 cells (n = 3, >100 metaphases per experiment). For one-way ANOVA 
separated chromatids, P < 0.0001; diplochromosomes, not significant. Right, 
representative images. Scale bar, 10 jim. b, c, Meta-TIF analysis of pre-crisis 
IMR-90 E6E7 cells. Scale bar, 10 Lm. Scatter plots show mean telomeric 
y-H2AX foci + s.e.m. on attached chromatids (m = 67) or separated 
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increased premature chromatid separation (Fig. 3a). Telomeric 
y-H2AX foci analysis (metaphase telomere-dysfunction-induced 
foci, meta-TIF)'*”° revealed that metaphase spreads with separated 
chromatids displayed increased telomere deprotection (Fig. 3b, c 
and Extended Data Fig. 5a). Accordingly, suppression of TRF2 in 
young IMR90 E6E7 cells not only caused fusion and prolonged mitosis 
(Fig. 2a, b), but also increased premature separation (Extended 
Data Fig. 5b). 

Cells exhibiting premature separation were mostly near-diploid, 
excluding the possibility that they have an increased number of telo- 
meric ends (Extended Data Fig. 5c). We conclude that prolonged 
mitosis in pre-crisis is associated with a telomere DDR, and that 
mitotic arrest occurs in near-diploid cells. Consistently the ratio of 
tetraploid cells in pre-crisis cultures did not increase as prominently 
as the percentage of cells undergoing mitotic arrest (Extended Data 
Fig. 5d, e). We also rarely observed diplochromosomes, which are a 
consequence of two rounds of DNA replication without mitosis 
(endoreduplication) and thereby tetraploid (Fig. 3a). 

Next we asked how prolonged mitotic arrest affects cellular fate 
in pre-crisis. Live cell imaging indicated three potential outcomes: 
cytokinesis, slippage (mitotic exit without cytokinesis), and cell death 
(Fig. 3d)*". Mitotic duration correlated with cellular fate, as cells that 
spent the least amount of time in mitosis underwent cytokinesis, and 
cells that resided longer in mitosis tended to slip or die (Fig. 3e). The 
cell death ratio increased significantly from 19% in cells arrested for 
2-6 h to 50% in cells arrested for 6-10 h and to 69% in cells arrested 
for >10 h (Fig. 3f). 

However, we noted that cells also died during interphase. Live- 
cell imaging revealed that 32% (86/266) of deaths occurred after 
prolonged mitosis (>2 hours), 14% (38/266) of cells died during 
mitosis lasting <2 hours and 53% (142/266) of cells died in inter- 
phase (Fig. 3g). To address whether cells that died during a short 
mitosis or interphase were associated with prolonged mitosis in the 
previous cell cycle, we traced the cells in question to the mitosis 
before death. Of the cells that succumb in interphase, 46% of the 
traceable previous mitosis was prolonged (18/39) (Fig. 3g). Of the 
cells that died during a short mitosis, 29% of the previous mitosis 
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was prolonged (4/14) (Fig. 3g), indicating that cell death during 
either short mitosis or in interphase follows prolonged mitosis in 
the previous cycle. 

Partial knockdown of TRF2 exacerbates mitotic telomere deprotec- 
tion in young IMR-90 E6E7 cells*. To understand whether mitotic 
telomere deprotection is the cause of death in pre-crisis, we tested 
whether a partial knockdown of TRF2 would enhance cell death upon 
spontaneous mitotic arrest. IMR-90 E6E7 cells were transduced with 
shTRF2-F at PD45, PD70 or PD85* (Extended Data Fig. 6a). The 
resulting partial suppression of TRF2 did not affect cell growth 
dynamics in the younger populations, but slowed down growth in 
the population infected at PD85 (Fig. 4a—c). However, in all settings, 
the cells entered crisis prematurely (Fig. 4a—c). y-H2AX foci analysis 
around PD100 revealed that shTRF2-F cells suffer from increased 
numbers of TIF, which was greatly enhanced on separated chromatids 
(Fig. 4d and Extended Data Fig. 6b, c). Accordingly, live-cell imaging 
revealed shTRF2-F expression increased the cell death ratio, especially 
after a short period of mitotic arrest (Fig. 4e). In contrast, TRF2 over- 
expression (Extended Data Fig. 6d), which partially suppresses mitotic 
telomere deprotection’, reduced cell death after mitotic arrest, sup- 
pressed TIF and delayed crisis (Extended Data Fig. 6e-g). Neither 
shTRF2-F nor TRF2 overexpression affected mitotic duration 
(Extended Data Fig. 6h, i). shTRF2-F also did not increase fusion 
frequency (Extended Data Fig. 6j), thereby attributing the increased 
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cell death to exaggerated loss of TREF2, not to increased fusion forma- 
tion. Accordingly, loss of TRF2 in cells lacking 53BP1 significantly 
increased cell death and shortened the mitotic duration before cell 
death upon colcemid-induced mitotic arrest (Extended Data Figs 3d 
and 7a, b). Furthermore, inhibition of Aurora B kinase by hesperadin, 
which suppresses mitotic telomere deprotection’ (Extended Data 
Fig. 7c), greatly reduced cell death upon colcemid exposure 
(Extended Data Fig. 7d, e), supporting a model in which amplified 
telomere deprotection induced by mitotic arrest triggers cell death. 

This model predicts that shTRF2-F would sensitize cells to drugs 
that induce mitotic arrest. Therefore HT1080 6TG cells expressing 
shScramble or shTRF2-F were exposed to Taxol, vinblastine, dimethyl- 
enastron”, and as a control the topoisomerase I inhibitor camptothecin. 
Expression of shTRF2-F significantly sensitized HT1080 6TG cells to 
Taxol, vinblastine, and dimethyl-enastron, all of which induce mitotic 
arrest (Fig. 4f). No such effect was observed upon exposure to camp- 
tothecin (Fig. 4f), confirming that TRF2 levels affect cellular fate specif- 
ically upon mitotic arrest. 

Here we show that chromosome end-to-end fusions during crisis 
cause spontaneous mitotic arrest, amplifying telomere deprotection, 
which determines cellular fate. We suggest that telomere deprotection 
upon spontaneous mitotic arrest is the underlying molecular signal 
that leads to cell death in crisis (Fig. 4g). While we cannot rule out 
a role of fusion breakage cycles and the resulting chromosomal 
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abnormalities as a cause of death”’, several observations argue against 
it. Only few fusions are observed in crisis cells (Fig. 2a), which is 
enough to trigger prolonged mitosis (Fig. 2b). Disruption of TRF2 
leads to the rapid onset of prolonged mitosis, arguing against long- 
term effects of fusion breakage cycles (Fig. 2a). Cells succumb to 
death in the first prolonged mitosis, or in the following cell cycle 
(Fig. 3f, g), ruling out a long-term process. Increasing damage signals 
without increasing fusion frequency causes more death (Fig. 4e and 
Extended Data Fig. 7a), arguing for signalling from deprotected telo- 
meres as the cause for death. As most cell death during pre-crisis is 
associated with mitotic arrest, we propose that prolonged mitosis is the 
main mechanism that limits cellular lifespan upon bypass of sen- 
escence. These findings might also offer a clinical opportunity, since 
exacerbation of mitotic telomere deprotection sensitizes cancer cells to 
mitotic drugs. Mitotic arrest, however, has also been associated with 
tumorigenesis in checkpoint-compromised cells**. Similarly, bone 
marrow failure and cancer in individuals with telomeropathies are 
frequent, which could potentially be explained by mitotic arrest result- 
ing from overly short telomeres*”*. Telomere-driven spontaneous 
mitotic arrest and the resulting mitotic telomere deprotection in 
the pre-crisis stage may thus function as a double-edged sword, 
explaining both cell death and chromosome instability upon bypass 
of senescence. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Cell culture and treatment. Human IMR-90 primary lung fibroblasts (ATCC) 
and their derivatives were grown in Glutamax-DMEM (Gibco) supplemented 
with 0.1 mM nonessential amino acids and 15% fetal bovine serum. HT1080 
6TG (ATCC) cells were grown in Glutamax-DMEM supplemented with 0.1 mM 
nonessential amino acids and 10% bovine growth serum. All cells were grown at 
7.5% CO, and 3% O,. Colcemid (Life Technologies), Taxol (A. G. Scientific), 
vinblastine (A. G. Scientific), dimethylenastron (A. G. Scientific), hesperadin 
(Selleck), reversine (Selleck) and camptothecin (Selleck) were used at the indicated 
concentrations. ATM inhibitor (KU-55933) (Tocris) was used at 10 uM. FACS 
analysis was performed as described”’. Cells were tested for mycoplasma contam- 
ination and found negative. 

Live-cell imaging. Live-cell imaging was performed in 8-well microslide cham- 
bers (iBidi) on a Zeiss inverted fluorescent microscope with a 20X 0.8 NA air 
objective at 37 °C and 7.5% CO, (XLmulti $1 module, Zeiss). Images were cap- 
tured with an AxioCam MRm (Zeiss) using Axio Vision software (Zeiss) typically 
every 6 minutes for at least 48 h. Mitotic duration was defined as movie frames 
from nuclear envelope breakdown or a previous frame of cell rounding to a frame 
of cytokinesis, slippage or mitotic cell death. Prolonged mitosis was defined as a 
mitosis that continues for more than 2 h. Cells that escaped from a movie screen 
during prolonged mitosis were included in the mitotic duration analysis but 
excluded from the cell fate analysis. 

For shTRF2-F, image capture was started 5 days after infection. For sgEMPTY 

(guide RNA without target), sgTRF2-1 and sgTRF2-2, image capture was started 
7 days after infection. Where indicated, population doubling is the one at their 
seeding, typically 1 day before the starting date of image capture. Typically more 
than two independent movies were analysed to confirm reproducibility except for 
Fig. 1b, where one movie was analysed per data point. Where indicated, Taxol, 
colcemid and hesperadin were added to the culture right before starting live 
imaging. 
Vectors and viral infections. Target sequences of CRISPR/Cas9 are as follows”*: 
sgTRF2-1, 5’-ACTGCATAACCCGCAGCAAT-(PAM)-3’; sgTRF2-2, 5’-TGTC 
TGTCGCGGATTGAAGA-(PAM)-3'; sg53BP1, 5’-CAGAATCATCCTCTAG 
AACC-(PAM)-3’; and sgLIG4, 5’-TGGCGTCGAAACATACTGAG-(PAM)-3’. 
Target sequences of sg53BP1 and sgLIG4 were first cloned into LentiCRISPR 
vector (Addgene plasmid 49535), followed by recloning of the guide RNA express- 
ion cassette (U6 promoter, target sequence and gRNA scaffold) into Nhel site of 
LentiCas9-Blast vector (Addgene plasmid 52962). BsmBI-digested LentiCRISPR 
vector and LentiGuide-puro (Addgene plasmid 52963) was incubated with T4 
PNK (NEB), Klenow fragment (NEB), and then T4 DNA ligase (NEB) to generate 
LentiCRISPR-sgEMPTY and LentiGuide-sgEMPTY, respectively. Silent mutations 
that pWZL-TRF2*"%A carries are: 5'-GCTCCICAGAGIGATGCAAT-3’ for 
sgTRF2-1; and 5’-TGCCTCAGCAGAATCGAGGA-3’ for sgTRF2-2. 

IMR-90 cells were infected with pLXSN3-HPV16E6, pLXSN3-HPV16E7, 
pLXSN3-HPV16E6E7, pLXSN3-p53dd, and pBabe-hTert retroviral vectors as 
described”, and subjected to long-term culturing in the presence of 600 tg ml’ 
G418 (Mediatech, Inc.) (for pLXSN3) or 2 ug ml puromycin (Mediatech, Inc.) 
(for pBabe). TRF2*8®N*- expressing IMR-90 E6E7 cells were selected in the pres- 
ence of 100 ng ml~' hygromycin (Mediatech, Inc.) for 6 days and subjected to 
LentiCRISPR infection. 

Lentiviral vectors pseudotyped with VSV glycoprotein were generated by 
the Salk Institute Gene Transfer, Targeting and Therapeutics (GT3) Core using 
a modified protocol’®. For shScramble (Addgene plasmid 1864), shTRF2-F, 
LentiCRISPR-sgEMPTY, LentiCRISPR-sgTRF2-1, LentiCRISPR-sgTRF2-2, 
LentiCas9-Blast, LentiCas9-Blast-sg53BP1, LentiCas9-Blast-sgLIG4, LentiGuide- 
sgEMPTY, and LentiGuide-sgTRF2-2, cells were plated in growth media contain- 
ing 4 pg ml ' polybrene and lentivirus and cultured for 2 days. Puromycin and 
blasticidin were added to the culture at 2 1g ml! and 10 ug ml}, respectively, 
and infected cells were selected for more than 3 days before analysis. Where 
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indicated, ATMi was added 4 days after infection and medium was refreshed every 
1-2 days. For fusion assay, cells were harvested 7 days after infection or at indi- 
cated PD. 

Antibodies. Primary antibodies: anti-y-H2AX (613402 clone 2F3, Biolegend); 
anti-TRF1 (Karlseder lab); anti- TRF2 (Karlseder lab); anti-53BP1 (H-300) (sc- 
22760, Santa Cruz); anti-LIG4 (ab80514, Abcam); anti-pericentrin (ab4448, 
Abcam); anti-MPM-2 (05-368MG, Millipore); anti-y-tubulin (T6557, Sigma- 
Aldrich); anti-H3S10P (D2C8) (3377, Cell Signaling); anti-GAPDH (A300- 
641A, Bethyl); anti-p53 (sc-126, Santa Cruz). 

Secondary antibodies: HPR-linked anti-mouse or anti-rabbit (NXA931 or 
NA934V; GE Healthcare); Alexa488-conjugated anti-rabbit (Invitrogen); 
Alexa594-conjugated anti-mouse (Invitrogen). 

Western blotting. Western blots were performed as described previously’. 
Immunofluorescence and telomere-centromere FISH on metaphase spreads. 
Cytocentrifugation, immunofluorescence and telomeric FISH were performed as 
described previously”. For telomere and centromere double-staining, Alexa488- 
conjugated telomeric PNA probe (TelC-A488, PNA Bio Inc.) and Cy3-conjugated 
centromeric PNA probe (CENT-Cy3, PNA Bio Inc.) were used. Percentage of 
telomere fusion per chromosome end was analysed as described”. For pericentrin 
foci analysis, cells in early mitosis (pro-, prometa- and metaphase) were selected 
according to chromosome shape from MPM-2-positive cells. For chromosome 
alignment analysis, cells in which most H3S10-P-positive chromosomes align 
between (and do not overlap with) two y-tubulin foci were selected as metaphase. 
Cells were analysed 7 days after infection or at indicated PD. 

Telomere blots. Double-stranded telomere analysis was performed as described 
previously”’. 

Viability assay. Premixed WST-1 Cell Proliferation Reagent (Clontech) was used 
for viability assay according to manufacturer's instruction. HT1080 6TG cells 
infected with either shScramble or shTRE2 were seeded in 96-well plates at 6 days 
post infection and exposed to Taxol, vinblastine, dimethyl-enastron, and camp- 
tothecin at 7 days post infection for 48 h. Triplicate wells were analysed for each 
drug concentration. The results were reproduced by three independent experi- 
ments. The loglCs, value was analysed by log(inhibitor) versus normalized res- 
ponse (variable slope) method using Prism 6 software. 

Statistical methods. Each figure legend shows the number of samples per 
experiment and number of experiments that were analysed independently. 
No statistical methods were used to predetermine sample size. Two-tailed 
unpaired ¢-tests and two-tailed Mann-Whitney tests were used to compare 
two data sets, where Gaussian distribution is assumed and not assumed, 
respectively. To detect any trends among multiple data sets in Fig. 3a and 
Extended Data Figs 5a and 7c, one-way ANOVA was used. Mitotic duration 
data sets of IMR-90 E6E7 cells in Fig. 1b were analysed with Kruskal-Wallis 
tests to detect trends (P<0.0001), in addition to Mann-Whitney tests, as 
described above. For statistical analysis of cellular fate after mitotic arrest, 
the ratio of death versus non-death (cytokinesis and slippage) was analysed 
with a two-tailed Fisher’s exact test. For Fig. 3f, data from short (2-6 h) and 
middle (6-10 h) mitotic arrest were combined and compared to that of long 
mitotic arrest (>10 h). The null hypothesis was rejected when P values were 
less than 0.05. No randomization was performed. The investigators were not 
blinded to allocation during experiments and outcome assessment. All statist- 
ical analysis was performed using Prism 6 software. 
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Extended Data Figure 1 | Mitotic duration in pre-crisis cells is elongated. prolonged mitosis analysed in Fig. 1d (1-4 independent experiments). The 
a, Percentage of cells that spend more than 2 h in mitosis (prolonged mitosis), dots represent independent experiments and the bars the mean. e, Scatter 
shown in Fig. 1b. b, Effect of indicated oncogenes on p53 expression. ¢, Scatter _ plots show mean (= s.e.m.) mitotic duration of IMR-90 E6E7 PD95 cells 
plots show mean mitotic duration + s.e.m. of individual IMR-90 derivative exposed to reversine and hesperadin (***P < 0.0001, Mann-Whitney test). 
cells analysed in Fig. 1d. d, Scatter plots with bars show mean percentage of The result was reproduced in two independent experiments. 
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Extended Data Figure 2 | Telomere elongation abrogates extended mitotic _cells expressing hTert at indicated PD shown in Fig. le, f. d, Effect of sgTRF2 
duration. a, Telomere elongation by hTert in IMR-90 E6E7 and p53dd cells on TRE2 expression 7 days after infection. e, f, Scatter plots with bars show 
shown in Fig. le, f. IMR-90 E6E7 and p53dd cells were infected at PD73 and —_—s mean percentage of cells with telomeric fusion (e) and prolonged mitosis (f) in 
PD82, respectively, and analysed at the indicated PD. b, c, Scatter plots show | IMR-90 E6E7 derivatives shown in Fig. 2a, b (1-3 independent experiments). 
mean mitotic duration + s.e.m. of individual IMR-90 E6E7 (b) and p53dd(c) = ***P < 0.0001. Mann-Whitney test. 
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Extended Data Figure 3 | Extended mitotic duration depends on telomere 
fusion. a, Effect of sgTRF2 on TRF2 expression in cells expressing sgTRF2- 
resistant TRE2 (TRF2*"™*) 9 days after infection with CRISPR/Cas9. 

b, c, Scatter plots with bars show mean percentage of cells with telomeric fusion 
(b) and prolonged mitosis (c) in IMR-90 E6E7 expressing sgTRF2 in the 
presence of TRE2**88N4 shown in Fig. 2c, d (two independent experiments). 
d, Schematic of 53BP1 or ligase IV suppression experiment in the presence of 
sgEMPTY or sgTRF2-2. e, Western analysis of IMR90 E6E7 cells expressing 
Cas9, Cas9-sg53BP1 or Cas9-sgLig4 in the background of sgEMPTY or 
sgTRF2-2. f, Representative meta-TIF images of cells suppressed for 53BP1 
or LIG4 in the presence of ssEMPTY or sgTRF2-2 as described in d. 


g, h, Percentage of telomeric fusion (g) and mitotic duration (h) in IMR-90 
E6E7 expressing sgEMPTY and sgTRF2 in the presence of DMSO or ATM 
inhibitor (mean + s.e.m.). i, j, Scatter plots with bars showing mean percentage 
of cells with telomeric fusion (i) and prolonged mitosis (j) in IMR-90 E6E7 
expressing sgTRF2 in the presence of ATM inhibitor shown in g, h (two 
independent experiments). k, Mean mitotic duration + s.e.m. of IMR-90 E6E7 
sgEMPTY cells exposed to 500 nM Taxol in the presence of DMSO or ATMi. 
1, Scatter plots show mean mitotic duration of individual cells shown in 

j. NS, not significant; *P < 0.05, **P < 0.005, ***P < 0.0001, Mann-Whitney 
tests. Results were reproduced in at least two independent experiments. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 
anaphase chromosome e metaphase chromosome d 
nobridge ‘Mill bridge aligned MH unaligned IMR90 EGE7 
SgEMPTY (n=54) & - sgEMPTY (n=97) 7 days post infection 
sgTRF2-2(n-21)/ | sgtrF2-2 (n=107) ma ° 
0.0 02 04 06 O08 10 0.0 02 04 06 O08 10 ep 
2 e 
b Bae oa 8 404 
pericentrin foci in pro-, prometa- and metaphase H3S10P -tubulin 5 40 
1 2 ME 3s = ® 
= (0) 307 
sgEMPTY (n=283) Ee 7 8 s ‘ 
eg TRF2-2(n=418) EE 3 fa) =O 
0.0 05 10 7 “ 
Merge + 2 107 
ericentrin MPM-2_DAPI RS ef 
FS p22 
5 ~v >a 
N“ af oO ty 
a =r 2¢fr 
8 ‘a'> ‘a’ 
‘c DH AD 
El a: 
Re ‘9 is 
a & 38 
o = 
: Be io ; 
Extended Data Figure 4 | Telomere fusions lead to multipolar mitosis. sgEMPTY cells are shown below (b, c). Results were reproduced in two 
a-c, Ratio of anaphase chromosome with or without anaphase bridge independent experiments. d, Scatter plots with bars show mean percentage of 
(a), pericentrin foci in pro-, prometa- and metaphase (b) and metaphase cells that possess tetraploidy (FACS analysis) and multipolarity (=3 pericentrin 
chromosome with or without unaligned chromosome (c) in IMR-90 E6E7 foci as in j) in IMR-90 E6E7 cells expressing sgEMPTY and sgTRF2-2 7 days 


expressing sgEMPTY and sgTRF2-2 7 days after infection (Fisher’s exact test, after infection (two independent experiments). **P < 0.005, ***P < 0.0001. 
for pericentrin foci, 1 and 2 foci versus =3 foci). Representative images from Fisher’s exact test. Scale bars, 10 um. 
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Extended Data Figure 5 | Mitotic arrest occurs in near-diploid cells. a, Bars 
show mean of three independent experiments of average number (+ s.e.m.) 

of telomeric y-H2AX foci in IMR-90 E6E7 at PD42, 72 and 105 as analysed in 
Fig. 4d (n = 25 per experiment, P = 0.0008, one-way ANOVA). b, Bars show 
the percentage of young IMR90 E6E7 cells with separated sister chromatids 

expressing sgEMPTY or sgTRF2-1/2 6 or 7 days post infection with CRISPR/ 
Cas9. Results from two independent experiments except for sgEMPTY 6 days 


LETTER 


post infection (at least 31 metaphases per experiment) are shown. c, Ratio of 
ploidy of mitotic IMR-90 E6E7 cells at PD106. Cells with attached and 
separated sister chromatids are plotted separately. d, FACS analysis of cells 
shown in Fig. 1b. e, Bars show percentage of prolonged mitosis and tetraploid 
cells (4N) in IMR-90 E6E7 cells at indicated PD. Percentage of prolonged 
mitosis are the same data sets as Extended Data Fig. la. 
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Extended Data Figure 6 | TRF2 suppression and overexpression affects cell 
fate. a, Effect of sha TRF2-F on TRF2 expression 7 days after infection. 

b,c, Telomeric and non-telomeric y-H2AX foci in individual pre-crisis IMR-90 
E6E7 cells expressing shScramble and shTRF2-F shown in Fig. 4b, c. Data in 
b and c were analysed as in Fig. 4d (n > 16, mean + s.e.m.). Metaphases 
with separated chromatids are shown in magenta. d, Effect of Myc-TRF2 on 
TRF2 expression 7 days after infection. e, Ratio of mitotic cell fate in pre-crisis 
IMR-90 E6E7-expressing TRF2 cells, analysed as in Fig. 3f (Fisher’s exact 
test, death versus non-death). f, Meta-TIF analysis of pre-crisis IMR-90 E6E7 
cells expressing a control vector or Myc-TRF2. Bars show mean (+ s.e.m.) 


telomeric and non-telomeric y-H2AX foci (n = 25, Mann-Whitney tests). 

g, Growth curve of IMR-90 E6E7 cells expressing a control vector or infected 
with Myc-TRF2-F at PD90. h, i, Mean (+ s.e.m.) mitotic duration of IMR-90 
E6E7 cells expressing shScramble and shTRF2-F (h) and Myc-TRF2 

(i) analysed in Fig. 4e and Extended Data Fig. 6e, respectively (Mann-Whitney 
test). j, Mean (+ s.e.m.) percentage of telomere fusion of pre-crisis IMR-90 
E6E7 expressing shScramble and shTRF2-F analysed in Fig. 4a—c 
(Mann-Whitney test). *P < 0.05, **P < 0.005, ***P < 0.0001. NS, not 
significant. 
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Extended Data Figure 7 | Amplified telomere deprotection during mitotic 
arrest in crisis causes cell death. a, Ratio of mitotic slippage and cell death in 
IMR-90 E6E7 Cas9-sg53BP1 cells expressing sgEMPTY and sgTRF2-2 in the 
presence of colcemid, analysed as in Fig. 3f (Fisher’s exact test, death versus 
slippage). b, Scatter plots show mean mitotic duration + s.e.m. before cell death 
of individual IMR-90 E6E7 Cas9-sg53BP1 cells expressing sgsEMPTY and 
sgTRF2-2 in the presence of colcemid (Mann-Whitney test). c, Bars show 
mean (+s.d.) of three independent experiments of average telomeric and non- 
telomeric y-H2AX foci in IMR-90 E6E7 cells at PD45 exposed to colcemid in 


LETTER 


iN) 
rs) 
3° 
2 


= 
a 
ro) 
2 


= 
ro) 
° 
2 


500, 


Mitotic duration to mitotic cell death (min) 


° 


IMR90 E6E7 
Cas9-sg53BP1 


HS slippage HE death 


col + dmso (n=59) 
col + hesp (n=73) 


2-6h 


col + dmso (n=76) 
col + hesp (n=58) 


6-10h 


col + dmso (n=68) 
col + hesp (n=30) 


3 
0.0 


10h< 


0.2 


the presence of DMSO or hesperadin at indicated concentrations for 24 h 
analysed as in Fig. 4d (50 metaphase per experiment). For one-way ANOVA 
telomeric foci, P < 0.0001; non-telomeric foci, not significant. d, Scatter plots 
show mean (+ s.e.m.) mitotic duration of IMR-90 E6E7 cells at PD27 exposed 
to 100 ng ml! colcemid in the presence of DMSO or 40 nM hesperadin 
(Mann-Whitney test). e, Ratio of mitotic cell fate in IMR-90 E6E7 cells 
around PD45 exposed to 100 ng ml colcemid in the presence of DMSO or 
40 ng ml‘ hesperadin (Fisher’s exact test, death versus slippage). **P < 0.005, 
***P < 0.0001. NS, not significant. 
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New cofactor supports a,fB-unsaturated acid 
decarboxylation via 1,3-dipolar cycloaddition 


Karl A. P. Payne’, Mark D. White', Karl Fisher!, Basile Khara!, Samuel S. Bailey’, David Parker’, Nicholas J. W. Rattray’, 
Drupad K. Trivedi, Royston Goodacre’, Rebecca Beveridge’, Perdita Barran', Stephen E. J. Rigby’, Nigel S. Scrutton', Sam Hay! 


& David Leys? 


The bacterial ubiD and ubiX or the homologous fungal fdcl and 
pad1 genes have been implicated in the non-oxidative reversible 
decarboxylation of aromatic substrates, and play a pivotal role in 
bacterial ubiquinone (also known as coenzyme Q) biosynthesis’ * 
or microbial biodegradation of aromatic compounds**, respect- 
ively. Despite biochemical studies on individual gene products, the 
composition and cofactor requirement of the enzyme responsible 
for in vivo decarboxylase activity remained unclear’”’. Here we 
show that Fdc] is solely responsible for the reversible decarboxylase 
activity, and that it requires a new type of cofactor: a prenylated 
flavin synthesized by the associated UbiX/Pad1”. Atomic resolu- 
tion crystal structures reveal that two distinct isomers of the oxi- 
dized cofactor can be observed, an isoalloxazine N5-iminium 
adduct and a N5 secondary ketimine species with markedly altered 
ring structure, both having azomethine ylide character. Substrate 
binding positions the dipolarophile enoic acid group directly 
above the azomethine ylide group. The structure of a covalent 
inhibitor-cofactor adduct suggests that 1,3-dipolar cycloaddition 
chemistry supports reversible decarboxylation in these enzymes. 
Although 1,3-dipolar cycloaddition is commonly used in organic 
chemistry'*””, we propose that this presents the first example, to 
our knowledge, of an enzymatic 1,3-dipolar cycloaddition reac- 
tion. Our model for Fdcl/UbiD catalysis offers new routes in 
alkene hydrocarbon production or aryl (de)carboxylation. 

Decarboxylation is among one of the most common reactions in 
nature, despite the fact it is inherently difficult to achieve at ambient 
conditions. This is due to the high energy of the carbanion intermedi- 
ate that is formed concomitant with carbon dioxide formation. To 
overcome this challenge, the majority of decarboxylases'? make use 
of cofactors, including organic cofactors such as flavins, pyridoxal 
phosphate (PLP), thiamine pyrophosphate (TPP) or metal ions such 
as Mg’*, Fe** or Mn?*. Decarboxylation is also frequently coupled to 
substrate oxidation. Comparatively few decarboxylases have been 
shown to require no cofactor and in these selected cases catalysis 
involves non-oxidative decarboxylation of those substrates for which 
the corresponding carbanion species can be stabilized using simple 
acid-base chemistry. Examples include orotidine monophosphate dec- 
arboxylase’* and arylmalonate decarboxylase’. 

The ubiX and ubiD or the related pad1 and fdcl genes have been 
shown to be responsible for non-oxidative reversible decarboxylation 
of aromatic substrates’~° (Fig. 1a). These genes are widely distributed 
in archaeal, bacterial and fungal genomes’®, and ubix and ubiD feature 
in the prokaryotic ubiquinone biosynthetic pathway’. Genetic stud- 
ies have led to the suggestion that both genes encode (redundant) 
decarboxylases'’. However, although UbiX/Pad1 proteins are distantly 
related to flavin-containing cysteine decarboxylases and have been 
shown to bind FMN, no in vitro decarboxylase activity has been 
detected”!*. Furthermore, most biochemical studies of UbiD/Fdcl1 have 
also failed to detect in vitro activity*’’. Although recent UbiD/Fdcl 


structures reveal distant structural homology to a family of NADH- 
FMN oxidoreductases, only metal ion binding rather than flavin bind- 
ing has been reported®. Hence, the composition and putative cofactor 
requirement of the enzyme responsible for the observed in vivo dec- 
arboxylase activity remained unclear. 

We co-expressed the Aspergillus niger fdcl gene in E. coli with either 
the associated A. niger pad1 or the homologous ubiX from E. coli. 
Although no direct interaction could be detected between the purified 
Fdcl and either UbiX or Pad1 proteins, clear differences could be 
observed between single expressed Fdcl and the corresponding ubiX 
or pad1 co-expressed Fdcl protein (denoted Fdcl"?™). Purified 
Fdc1* has distinct spectral properties (Fig. 1b) and catalyses revers- 
ible decarboxylation of a wide range of aromatic carboxylic acids in 
vitro (Fig. 1c, d and Extended Data Fig. 1). Similar observations were 
made for Saccharomyces cerevisiae Fdc1“"™, although in this case the 
corresponding Fdcl singly expressed protein (but not Fdcl”*) 
weakly binds FMN (Extended Data Fig. 2). During the review stage 
for this publication, similar observations for the S. cerevisiae Fdcl were 
reported elsewhere, although the Fdcl cofactor was not identified”. 
Crystal structures of A. niger, Candida dubliniensis and S. cerevisiae 
Fdc1"”* reveal a heavily modified FMN cofactor is bound by these 
enzymes (Fig. 2 and Extended Data Fig. 3). The modified FMN phos- 
phate is bound in complex with metal ions, similar to the distantly 
related FMN-binding protein from Methanobacterium thermoauto- 
trophicum'. The electron density suggests both Mn** and K™ ions 
are present, and Mn*~ binding can be detected using EPR (Extended 
Data Fig. 4). Early studies on E. coli UbiD revealed activation by both 
Mn** and an unidentified cofactor’. The electron density for the 
modified FMN reveals extensive modification has occurred at both 
N5 and C6 positions, effectively adding a fourth ring to the isoallox- 
azine moiety (Fig. 2b). The atomic resolution obtained for A. niger 
Fdc1* allows identification of the cofactor structure as an isopente- 
nyl-adduct to the flavin N5-C6 or prenylated FMN (prFMN) (Fig. 2c). 
The branched nature of the isopentenyl adduct and the position of the 
covalent linkages with the flavin suggest this modification is achieved 
through prenylation. In the accompanying paper’®, we show that UbiX 
indeed acts as a flavin prenylase enzyme that can support Fdcl activa- 
tion in vitro. Activation is dependent on the presence of oxygen, 
suggesting the reduced prFMN UbiX product is oxidized to the cor- 
responding flavo-N5 iminium adduct (prFMN'™""™; Fig. 2d). The 
presence of the oxidized prFMN was confirmed by high-resolution 
mass spectrometry of both the isolated cofactor and native mass 
spectrometry of the Fdcl°* protein (Extended Data Fig. 5). 
Furthermore, complete inactivation of Fdc1¥>* was achieved using 
the mild reductant sodium cyanoborohydride. Upon reoxidation of 
the sodium cyanoborohydride inactivated enzyme, a radical species 
was formed with spectral and EPR properties similar to that of the 
prFMN: radical species detected during non-physiological oxidation 
of the UbiX-prFMN complex’° (Fig. 2e and Extended Data Fig. 4). 


1Centre for Synthetic Biology of Fine and Speciality Chemicals, Manchester Institute of Biotechnology, University of Manchester, 131 Princess Street, Manchester M1 7DN, UK. 2Innovation/Biodomain, 
Shell International Exploration and Production, Westhollow Technology Center, 3333 Highway 6 South, Houston, Texas 77082-3101, USA. 
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Figure 1 | Fdcl solution data. a, Schematic overview of the reaction catalysed 
by Fdcl/Pad1 and the related UbiD/UbiX proteins. b, UV-visible spectra 
obtained for heterologous expressed A. niger Fdcl, with and without 
co-expression of a ubiX gene. c, UV-visible observation of the enzymatic 
conversion of cinnamic acid to styrene via decarboxylation by Fdc1“°™. The 
initial spectrum of cinnamic acid shows a Amax of 270 nm. Over time, 
successive spectra show reduction of the 270 nm peak and appearance of a 
peak at 245 nm corresponding to styrene formation. d, Steady-state kinetic 
parameters obtained for Fdc1”>™ for sorbic acid and a variety of cinnamic 
acid-type compounds (error bars are s.e.m., n = 3). 


Furthermore, the amount of prFMN‘ radical formed is proportional to 
the levels of enzyme activity obtained before cyanoborohydride inac- 
tivation (Fig. 2f), suggesting that enzyme inactivation occurs by modi- 
fication (probably hydrolysis) of the active prFMN species. 

Marked heterogeneity of the prFMN®™ cofactor is observed in the 
A. niger Fdc1°* wild-type crystal structures, with older crystal 
samples revealing an increased population of a hydroxylated 
prFMN'™""™, The latter species could also be observed by high- 
resolution mass spectrometry (Extended Data Fig. 5a). Surprisingly, 
a distinct prFMN™ isomer (prEMN*t™ne) can be observed for most 
A. niger Fdcl crystals, although it is not detected in the C. dubliniensis 
or S. cerevisiae crystals (Fig. 3a). This form contains an altered isoallox- 
azine ring structure, a likely consequence of isomerization of the 
extended ring system during oxidative maturation of the prFMN cofac- 
tor. We propose this isomerization step occurs during the stepwise 
oxidation of the reduced prFMN UbiX product, via a radical mech- 
anism (Extended Data Fig. 6). The isomerization leads to insertion of 
the prenyl C1’ carbon into the flavin isoalloxazine ring, creating a 
central seven-membered ring and converting the N5-iminium into 
a secondary ketimine. The conformation derived from density func- 
tional theory (DFT) calculations for the proposed prEMN*t™"* 
species is in close agreement with that observed in the crystal structure. 
The calculations indicate this species is 30-40 kJ mol * lower in gas 
phase free energy than the prFMN'™'"""™ species and can exist in 
two butterfly bent-like conformations of similar energy (Extended 
Data Fig. 7). 

To obtain insights into the catalytic mechanism, A. niger Fdc1¥°™ 
crystals were soaked with a range of trans-cinnamic-acid-related com- 
pounds. The corresponding enzyme-substrate complexes clearly 


Figure 2 | Fdcl>™ crystal structures. a, Crystal structure of the A. niger 
Fdc1¥°*, The monomer present in the asymmetric unit is depicted in 
cartoon format (helices in green, sheets in blue) while the symmetry related 
monomer forming the Fdc1”™ dimer is shown in grey. b, Detailed view of 
Mn’* binding site, linking the modified FMN phosphate group to the 
protein. c, Detailed view of the modified FMN cofactor. Omit electron 
density map corresponding to the bound cofactor contoured at 50. A fourth 
ring (non-isoalloxazine derived atoms are shown in green) can clearly be 
observed. An asterisk indicates the position of additional weak electron 
density at the C1 position, which can be accounted for by partial hydrolysis. 
Key residues involved in polar interactions with the isoalloxazine derived 
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cofactor moiety are shown as sticks. d, Chemical structure of the modified 
FMN bound by A. niger Fdcl””* as derived from atomic resolution density 
and high-resolution mass spectrometry (Extended Data Fig. 5). Atoms 
derived from dimethylallyl-monophosphate are shown in red’°. The 
azomethine ylide resonance form is shown below. e, Oxidation of 
cyanoborohydride inactivated Fdc1””™ leads to formation of a purple 
species, similar in UV-visible and EPR spectral properties (Extended Data 
Fig. 4) to that observed for the oxidized UbiX-prFMN complex”’. 

f, Following purification, Fdc1”™ activity gradually decreases when 
incubated on ice, and formation of the purple prFMN radical following 
NaBH3CN treatment follows a similar trend (error bars are s.d., n = 3). 
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Figure 3 | Fdc1™ cofactor structure and ligand complexes. a, Omit 
electron density map corresponding to a distinct isomer of the prFMN 
contoured at 5c. An expansion of the central ring of the isoalloxazine system 
can clearly be observed, with the distinct butterfly bent conformation 
accompanied by altered conformation of the ribityl moiety. b, Detailed view of 
prFMN® isoforms in the Fdc1”>*-substrate complexes. The omit electron 
density maps corresponding respectively to the prFMN""" (in blue) and 


reveal substrates are stacked directly above the prFMN™ ring system 
(Fig. 3b, c). Electron density for both the iminium and ketimine species 
can be detected in the atomic resolution density maps, the latter adopt- 
ing a more planar conformation of the extended ring system in order to 
accommodate substrate binding (Fig. 3b, c and Extended Data Fig. 7). 
The substrate enoic acid double bond is positioned directly above 
the C4a of both the prFMN™ species. Although the aryl-group of all 
substrates tested occupies a similar position directly above the prFMN 
N1 atom, electron density corresponding to the carboxylate moiety 
is distinct for each substrate (Fig. 3c). Although clear electron den- 
sity for the pentafluorocinnamic acid carboxylate is lacking, both 
o-methylcinnamic acid and «-fluorocinnamic acid carboxylate groups 
can be observed. Both occupy a position vacated by reorientation of 
the conserved E282, and establish polar contacts with R173 and the 
amide nitrogen of M283. In the case of a-methylcinnamic acid, the 
carboxylate is within hydrogen bonding distance of the N5 secondary 
ketimine group of the prFMN*"""* form. A complex with the dec- 
arboxylation product 4-vinyl guaiacol (a distinct flavouring agent 
obtained by yeast-mediated decarboxylation of ferulic acid) reveals a 
similar position for the aryl-moiety (Fig. 3c). The vinyl group is placed 
directly above the prFMN™ C4a, in close proximity to electron density 
resembling a bound CO, molecule. 

At first glance, the position of the substrate «,B-unsaturated carbo- 
nyl directly above the prFMN®™ C4a suggests the possibility of tran- 
sient formation of a prFMN®™ C4a substrate B-carbon bond reminis- 
cent of Michael addition-like chemistry and other flavin-catalysed 
reactions”. This mechanism has similarities to the amidohydrolase- 
type decarboxylases, where nucleophilic attack by a metal-bound 
hydroxide ion is postulated to lead to transient C=C bond cleavage 
of an enoic acid moiety leading to decarboxylation’*”’. In the case of 
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prFMN“™© (in red) species each contoured at 5c are shown for the 
a-methyl-cinnamic acid complex. c, A series of Fdc1¥>* substrate/product 
complexes. Selected active site residues of Fdc1””™ are shown in atom colour 
sticks, with the omit electron density contoured at 5a corresponding to 
a-methyl-cinnamic acid, pentafluorocinnamic acid, -fluorocinnamic acid and 
4-vinyl-guaiacol, respectively. 


Fdcl, C=C bond cleavage should proceed with concomitant protona- 
tion of the o-carbon to allow decarboxylation. Our structural data 
suggests the N5 secondary ketimine of prFMN**"'™"* could act as a 
probable acid-base catalyst, providing a rationale for both FMN modi- 
fication and rearrangement (Fig. 4a and Extended Data Fig. 7b). 
However, although our solution mass spectrometry and chemical 
inhibition data cannot distinguish between the iminium and ketimine 
cofactor isomers, solution data presented in Fig. 2e, f suggests 
prFMN'™""™ is the catalytically relevant species. 

To determine which prFMN™ isomer supports catalysis by Fdc1 
and provide further insight into the likely mechanism, we sought to 
determine the structure of a covalent substrate-cofactor adduct. When 
A. niger Fdc1™ protein is incubated with phenylpyruvate, for which 
the corresponding enol-tautomer o-hydroxycinnamic acid (present 
as a minor population in solution) closely resembles the cinnamic 
acid substrate, the UV-visible spectrum is altered (Fig. 4b). Similar 
observations are made when incubating with phenylacetaldehyde. 
Furthermore, incubation with phenylpyruvate or phenylacetaldehyde 
leads to reversible enzyme inhibition, with a gradual increase of enzyme 
activity upon removal of excess phenylpyruvate (Fig. 4b). The corres- 
ponding crystal structure of A. niger Fdc1”™ incubated with phenyl- 
pyruvate reveals a covalent adduct formed between the prenyl-C1’ of 
the prEMN™™ cofactor and a substrate-derived entity (Fig. 4c). 
Atomic resolution electron density reveals the latter clearly corre- 
sponds to a phenylacetaldehyde adduct, a species that can be formed 
following decarboxylation of «-hydroxycinnamic acid and tautomer- 
isation of the corresponding o-hydroxystyrene prFMN" adduct 
(Fig. 4d, species IIIb). The reversible inhibition observed can be 
accounted for by the fact the corresponding enol-keto tautomer equi- 
librium is in favour of the off-pathway phenylacetaldehyde adduct. 
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Figure 4 | Insights into the Fdc1”"™ mechanism. a, Proposed mechanism for 
Fdc1”>* catalysis using the prFMN ketimine form. b, Incubation of Fdc1 
with phenylpyruvate leads to changes in the UV-visible spectrum. The inset 
shows conversion of cinnamic acid to styrene (monitored at 270 nm) after 
addition of phenylpyruvate treated Fdc1™™. c, Active site of Fdc17?™ in 

This observation further confirms that prFMN'™"™"" is the cataly- 
tically relevant species, and the unexpected presence of a C1'- substrate 
a-carbon bond suggests a distinct mechanism from that of the ami- 
dohydrolase-type decarboxylases occurs (Fig. 4d). Fluorination of the 
substrate f-carbon or the aromatic group has only a modest effect on 
catalysis (Fig. 1d). Furthermore, simple enoic acids also act as sub- 
strates, as prolonged incubation of Fdc1¥>* with these substrates does 
lead to formation of the corresponding terminal alkenes («-olefins) at 
modest levels (Extended Data Fig. 8). Both observations argue against 
formation of transient ionic species. 

The neutral form of the prFMN"""""™ has distinct azomethine ylide 
character (a well-known 1,3-dipole) that is positioned directly adjacent 
to the substrate «,B-unsaturated carbonyl (a dipolarophile) by the 
enzyme. Thus, we propose a 1,3-dipolar cycloaddition between the 
substrate and the prFMN’™""""™ azomethine ylide form leads to a 
transient covalent substrate-prFMN'™""™ pyrrolidine adduct, estab- 
lishing covalent bonds with both Cl’ and C4a (species II). Ample 
precedent is available for the concerted 1,3-dipolar cycloaddition 
between azomethine ylides and «,B-unsaturated carbonyls in organic 
chemistry’*’?. Whether the proposed Fdcl-mediated cycloaddition 
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complex with a phenylpyruvate derived adduct. The omit electron density map 
(contoured at 7a, in blue) corresponding to the prFMN" —phenylpyruvate 
adduct is shown. Atoms derived from phenylpyruvate are depicted in green. 
d, Proposed mechanism for Fdc1“™ catalysis and phenylacetaldehyde adduct 
formation of the prFMN'™""™ form. 


occurs through a single pericyclic transition state cannot be 
established at this stage. Fragmentative decarboxylation (Grob-type 
fragmentation”) of the pyrrolidine adduct II can occur coupled to 
breaking the B-carbon-prFMN C4a bond. This leaves a single bond 
connecting the substrate o-carbon with the Cl’ of prFMN'™""™ 
(species IIIa), as observed in the phenylpyruvate adduct IIIb. We 
propose protonation by E282 concomitant with formation of a second 
pyrrolidine adduct (species V) leads to product formation (species VI) 
via a retro 1,3-dipolar cycloaddition. Together with R172 and E277, 
E282 forms a network of polar interactions that is conserved through- 
out the UbiD/Fdcl family. The R173A, E277Q and E282Q variants of 
Fdc1¥* are all inactive, and display altered UV-visible spectrum 
properties, suggesting these mutations also affect cofactor maturation 
(Extended Data Fig. 9). 

The modification of the flavin isoalloxazine ring through prenyla- 
tion at N5 and C6, followed by oxidation strikingly alters and 
expands the flavin chemical repertoire’. Oxidation of the N5-prenyl 
adduct bond leads either to the corresponding iminium adduct or, 
when coupled to ring isomerization, a secondary ketimine adduct. 
Both species have azomethine ylide character and distinct catalytic 
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potential. In the Fdcl/UbiD enzyme family, the prEMN™™""" form 
supports reversible decarboxylation of a wide range of (aromatic) 
substrates. Although the [3+2] reaction between azomethine 
ylide dipoles and alkene dipolarophiles has been extensively used 
in organic chemistry'’’’, the mechanism proposed here would pre- 
sent the first example, to our knowledge, of a biological [3+2] reac- 
tion. Combined with the recent description of both natural”? and 
artificial”® bona fide [4+2] cycloaddition catalysing enzymes, this 
hints at more widespread use of pericyclic reaction chemistry in nat- 
ure. As Fdcl/UbiD enzymes have evolved from an NADH-FMN 
oxidoreductase module*”’, the distinct possibility exists that other 
unrelated prFMN-dependent enzymes might have developed from 
distinct flavin-binding modules. Such distinct prFMN-dependent 
enzymes could make use of different aspects of the prFMN chemistry, 
as occurs with other organic cofactors’. Artificial flavoenzymes con- 
taining N5-alkylated flavins have been created that are capable of 
H,0,-driven enantioselective sulfoxidations”, using the fact that 
N5-alkylated flavins can form remarkably stable 40-peroxyflavins 
and are powerful oxidising catalysts”. It is possible similar enzymes 
already exist in nature. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Cloning. The S. cerevisiae, C. dubliniensis and A. niger fdcl genes were codon 
optimized and synthesized (Genscript). The A. niger fdcl gene was cloned into the 
Ndel and Xhol sites of pET30a and the S. cerevisiae and C. dubliniensis genes were 
cloned into the NdeI and Xhol sites of pET21b. E. coli ubiX was cloned into the 
Ndel and Xhol sites of pET21b and pET30a. A. niger fdcl pET30a was transformed 
into E. coli BL21(DE3) with and without ubiX pET21b and S. cerevisiae and C. 
dubliniensis fdcl pET21b were transformed into E. coli BL21(DE3) with and 
without ubiX pET30a. 

Mutagenesis. Mutagenesis primers were designed using the QuikChange Primer 
Design Program (http://www.genomics.agilent.com/primerDesignProgram jsp). 
PCR was performed using Phusion polymerase (NEB). Template was removed 
by DpnI (NEB) digestion and the PCR product transformed into E. coli NEB5«. 
Once the presence of the desired mutation was confirmed by DNA sequencing, the 
plasmid was co-transformed with the appropriate ubiX construct into E. coli 
BL21(DE3). 

Protein expression and purification. Protein was expressed in BL21(DE3) grown 
at 37 °C at 180 r.p.m. in LB broth supplemented with 50 jg ml” ' kanamycin and/ 
or 50 ug ml * ampicillin. At mid-log phase cells were induced with 0.25 mM IPTG 
and supplemented with 1 mM MnCl, grown overnight at 15 °C at 180 r.p.m. and 
then collected by centrifugation (4 °C, 7,000g for 10 min). Cell pellets were resus- 
pended in buffer A (200 mM NaCl, 1 mM MnCl, 50 mM Tris pH 7.5) supple- 
mented with DNase, RNase, lysozyme (Sigma) and Complete EDT A-free protease 
inhibitor cocktail (Roche). Cells were lysed using a French press at 20,000 p.s.i. and 
the lysate clarified by centrifugation at 125,000g for 90 min. The supernatant was 
applied to a Ni-NTA agarose column (Qiagen). The column washed with 3 col- 
umn volumes of buffer A supplemented with 10 mM imidazole and protein eluted 
in 1-ml fractions with buffer A supplemented with 250 mM imidazole. Samples 
were subjected to SDS-PAGE analysis and fractions found to contain the purified 
protein were pooled. Imidazole was removed using a 10-DG desalting column 
(Bio-Rad) equilibrated with 100 mM NaCl, 1 mM MnCl, 25 mM Tris pH 7.5. 
Protein was aliquoted and flash frozen until required. 

UV-visible spectroscopy/protein quantification. UV-visible absorbance spectra 
were recorded with a Cary UV-vis spectrophotometer. The protein concentration 
of A. niger Fdcl and S. cerevisiae Fdc1 was estimated using €2g9 = 68,870 M ‘cm! 
and €g9 = 63,830 M~! cm! (calculated from the primary amino acid sequence 
using the ProtParam program on the ExPASy proteomics server), respectively. 
UV-visible decarboxylation assays. For substrates with two or more double 
bonds conjugated to the acid group initial rates were determined by following 
consumption of substrate by UV-visible spectroscopy using a Cary 50 Bio spec- 
trophotometer (Varian). Assays were performed against various concentrations of 
substrate in 350 jl 50 mM KCl, 50 mM NaPi pH 6 ina 1 mm path length cuvette at 
25°C. The rate of 2,3,4,5,6-penta-fluorocinnamic acid consumption was moni- 
tored at 260 nm, o-fluorocinnamic acid at 265 nm, cinnamic acid at 270 nm and 
sorbic acid at 260 nm. Extinction coefficients for each substrate were calculated 
from a range of standards and found to be €2¢69 = 20,000 M! cm‘! for 2,3,4,5,6- 
penta-fluorocinnamic acid; ¢¢5 = 20,000 M~' cm™' for ot-fluorocinnamic acid; 
279 = 20,000 M-! cm! for cinnamic acid; and €269 = 24,000 M! cm“! for sorbic 
acid, the latter two values being in good agreement with the literature*®*'. 

A. niger Fdcl phenylpyruvate inhibition. Sodium phenylpyruvate powder was 
added to the protein until there was no further change in the ~A3go spectrum. Any 
precipitate was removed by centrifugation at 16,100g and excess phenylpyruvate in 
solution removed using a 10-DG desalting column (BioRad). 

A. niger Fdcl1 inactivation timecourse. Enzyme was thawed and incubated on 
ice. The enzyme was assayed against cinnamic acid at various time points and the 
level of activity expressed as a percentage of that of freshly thawed protein (0 min). 
At various time points samples were also treated with NaBH3CN powder, centri- 
fuged at 16,100g for 15 min and UV-visible spectra recorded. 

GC headspace decarboxylation assays. A total of 500 pl 10 mM substrate (2- 
hexenoic acid, 2-heptenoic acid, 2-octenoic or 2-nonenoic acid), 50 mM KCl, 
50 mM NaPi pH 6 in a 2 ml GC vial was incubated with or without enzyme at 
25 °C with continuous shaking. 0.5 ml head space volume was manually drawn off, 
and injected into the GC with a syringe-lock needle. A Varian 3800 GC equipped 
with a DB-WAX column (30 m X 0.32 mm X 0.25 pm film thickness, JW 
Scientific) was used to detect and quantify 1-alkenes produced during the enzyme 
reactions. The column temperature was programmed as follows: 40 °C hold for 2 
min, to 150°C at 20°C min’. The injector temperature was 250 °C (10:1 split), 
and the FID temperature was set at 250 °C. The carrier gas was helium at a flow 
rate of 1 ml min” '. Product identification was achieved by comparison with pure 
1-alkene standards. 

Carboxylation HPLC assays. Assays containing 10 mM styrene, 100 mM NaPi 
pH 6, 3 M NaHCO; (final pH 7.5), were incubated with and without enzyme at 
25 °C overnight. The sample was centrifuged at 16,100g to remove precipitate and 


100 pl added to 900 pl 50% v/v H0/acetonitrile. Sample analysis was performed 
using an Agilent 1100 Series HPLC equipped with a UV detector. The stationary 
phase was a Kinetex 5 tm C18 100A column, 250 X 4.6 mm. The mobile phase was 
acetonitrile/water (50/50) with 0.1% TFA at a flow rate of 1 ml min’ ', and 
detection was performed at a wavelength of 254 nm. 

EPR. NaBH3CN powder was added to Fdcl”* (215 uM). The samples were 
incubated on ice and UV-visible spectra recorded until the purple colour (Asso) 
reached a maximum. The samples were centrifuged to remove any remaining 
powder or effervescence, 300 il of sample was transferred to an EPR tube and 
frozen in liquid nitrogen. EPR spectra were obtained using a Bruker ELEXSYS 
E500 EPR spectrometer equipped with a Super High Q (ER 4118-SHQ) res- 
onator. Temperature control was achieved using an Oxford Instruments 
ESR900 cryostat connected to an ITC 503 temperature controller from the same 
manufacturer. 

Protein crystallization and structure determination. Crystallization was per- 
formed by vapour diffusion. An initial screening of 0.3 pl 14 mg ml”! A. niger 
Fdcl in 100 mM NaCl, 25 mM Tris pH 7.5 and 0.3 Ll reservoir solution at 4°C 
resulted in a number of hits including PACT condition F4 (molecular dimensions). 
Seed stock produced from these crystals was used in an optimization screen based 
around 0.2 M potassium thiocyanate, Bis-Tris propane 6.5, 20% w/v PEG 3350 
mixing 0.05 ll seed stock, 0.25 ll protein solution and 0.3 pl reservoir solution at 
4°C. For S. cerevisiae Fdcl, crystals were obtained using 0.3 jl 2 mg ml‘ in20 mM 
potassium phosphate, 200 mM NaCl pH 7.5 and 0.3 ul of 0.2 M calcium acet- 
ate, 20% PEG 3000 at 4 °C. Crystals for C. dubliniensis were obtained at 4 °C using 
0.3 pl 18 mg ml! C. dubliniensis Fdcl in 100 mM NaCl, 1 mM MnCl, 25 mM Tris 
pH 7.5 and 0.3 ul of 0.2 M potassium thiocyanate, sodium acetate pH 5.5 and 
15% w/v PEG 4000. 

All crystals were cryoprotected in mother liquor supplemented with 10% PEG 

200 and flash cooled in liquid nitrogen. Diffraction data was collected at Diamond 
beamlines at 100K and processed using the CCP4 suite”. Data was reduced and 
scaled using XDS*’. Following molecular replacement using 41P2 as a starting 
model, the structures of wild-type A. niger and S. cerevisiae Fdcl and complexes 
were refined using REFMACS (ref. 32), and refined by cycles of manual rebuilding 
in COOT. Ligand coordinates and definitions were generated using the 
GlycoBioChem PRODRG2 server (http://davapcl.bioch.dundee.ac.uk/cgi-bin/ 
prodrg). The data and refinement statistics are available in Extended Data Table 1. 
Native mass spectrometry. Protein was buffer exchanged into 100 mM ammo- 
nium acetate using a 10-DG desalting column (BioRad). Native mass spectrometry 
experiments were carried out on a Synapt G2 instrument (Waters, Manchester, 
UK) with a nanoelectrospray ionisation (nESI) source. Mass calibration was per- 
formed by a separate infusion of Nal cluster ions. Solutions were ionised through a 
positive potential applied to a platinum wire of thickness 0.125 mm (Goodfellow) 
inserted into a thin-walled glass capillary (inner diameter 0.9 mm, outer diameter 
1.2 mm, World Precision Instruments, Stevenage, UK) that was pulled to a nESI 
tip in house with a Flaming/Brown micropipette puller (Sutter Instrument Co., 
Novato, CA). Fdcl samples (5-10 1M) were sprayed from 100 mM ammonium 
acetate pH 6.8. Capillary voltage 1.6 kV, sample cone voltage 80-90 V, extractor 
cone voltage 3 V, backing pressure 5 mbar, source temperature 50 °C, trap gas 
flow 0.4-5 ml min™'. Data were processed using Masslynx V4.1 software and 
Originlab 9.0. 
Mass spectrometry analysis of the extracted Fdcl cofactor. Protein was buffer 
exchanged into deionised H,O using a 10-DG column. 4 pl 1 M ammonium 
acetate was added to 200 tl protein solution. The protein was precipitated and 
cofactor released with addition of 200 kl acetonitrile and incubation at 70 °C for 5 
min before centrifugation at 16,100g to remove the precipitate. 

All solvents were of HPLC-MS Grade purity purchased from Sigma-Aldrich 
(Gillingham, UK). HPLC grade formic acid was purchased from Fisher Scientific 
(Loughborough, UK). Mass spectrometer calibration solution and chromato- 
graphy columns were purchased from Thermo-Fisher Scientific (Hemel 
Hempstead, UK). 

All UHPLC-MS work was carried out on a Thermo-Finnigan Orbitrap-LTQ XL 
hybrid mass spectrometer operated in negative ionization mode coupled to a 
Thermo Accela autosampler (Fisher Scientific, Bremen, Germany). Chromato- 
graphic separations were adapted from work carried out by Fu and co-workers™ 
and performed on a Thermo Hypersil Gold 2.1 Jim C;g column at a solvent flow- 
rate of 400 pl min’. For initial profiling tests the column was eluted with 0.1% 
formic acid in water (A) and 0.1% formic acid in acetonitrile (B). The solvent 
composition during gradient elution was initiated with 5% (B) for 5 min and 
subsequently ramped to 95% (B) over 15 min, followed by a5 min isocratic elution 
at 95% (B) before a return to 95% (A) held for further 5 min for column equilib- 
ration. All samples were maintained 4°C within the autosampler refrigerator 
while the column was maintained at 50°C within the autosampler oven. Mass 
calibration was carried out in accordance with the manufacturer’s guidelines using 
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caffeine (20 jg ml~'), the tetrapeptide MRFA (1 pg ml~') and Ultramark 1621 
(0.001%) in an aqueous solution of acetonitrile (50%), methanol (25%) and acetic 
acid (1%). Acquisition settings for initial profiling were carried out at 60,000 
resolution in centroid and ran at 1 1-scan per 400 ms in the 100-1,000 m/z range 
with source gasses set at sheath gas = 40 arbitrary units, aux gas = 5 arbitrary 
units, sweep gas = 5 arbitrary units. The ESI source voltage was set to 4.2 V, and 
capillary ion transfer tube temperature set at 275 °C. 

Mass fragmentation analysis was carried out with the same column chemistry, 
source settings and flow rate but with an isocratic solvent elution of 55% (A)/45% 
(B), an optimum composition determined for eluting the analyte of interest, based 
on the initial elution profile described above. Mass spectrometry source, samples 
storage/column was kept under identical conditions. Collision induced dissoci- 
ation (CID) settings were set up to trap ions 523 m/z with an isolation width of 
1.0 m/z, normalized collision energy of 35, activation Q of 0.250 and activation 
time of 30 ms. 

DFT calculations. Density function theory (DFT) models were optimized in the 
gas phase using the B3LYP/6-311+ +G(d,p) or BH&H/6-311+ +G(d,p) level of 
theory implemented in Gaussian 09 (ref. 35). Models are shown in Extended 
Data Fig. 7 and structural alignments to the crystal coordinates were performed 
using Swiss-PdbViewer version 4.1 (ref. 36). Harmonic vibrational frequencies 
calculated using normal mode analysis were used to confirm that optimized 
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geometries were always in local or global minima, and to calculate relative free 
energies. 

Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. 
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Extended Data Figure 1 | HPLC chromatogram demonstrating enzymatic carboxylation of styrene by A. niger Fdcl?*, Chromatogram of a 10 mM styrene 
and saturating NaHCO; solution incubated in the absence (blue) and presence (red) of the A. niger Fdc1”* enzyme. 
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Extended Data Figure 2 | S. cerevisiae Fdc1?™ solution data. The top panel is a direct comparison to main Fig. 1b. In this case, the non-UbiX coexpressed Fdcl 
binds EMN weakly. The bottom panel shows S. cerevisiae Fdc1>™ steady-state kinetic data obtained with cinnamic acid. Error bars are s.e.m., n = 3 
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E285 


E240 
S. cerevisiae Fdc1, 1.65 A C. dubliniensis Fdc1, 2.5 A 
Extended Data Figure 3 | A detailed view of the S. cerevisiae and C. dimethylallyl-monophosphate are shown in green. Both Fdc1™ crystals 
dubliniensis Fdc1"™ active site. The omit map corresponding to the prFMN __ appeared colourless as observed for A. niger Fdc1“™, indicating negligible 


cofactor is shown in a blue mesh contoured at 3c. Atoms derived from FMN binding. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


A 

B 

L 1 tT 1 1 1 me | L 1 1 1 L 1 1 
2700 2900 3100 3300 3500 3700 3900 3280 «©3300 3320 3340 3360 3380 3400 


Magnetic Field (G) 


Extended Data Figure 4| EPR spectroscopy confirms Mn’* binding. Left 
panel shows X-band continuous wave frozen solution EPR spectra of (A) 
Fdc1¥"* and (B) Mn(H,0),7* showing the characteristic six line pattern 
arising from the m, = + 1/2 spin manifold of the $ = 5/2 Mn** ion. The six line 
pattern reflects hyperfine coupling to the I = 5/2 *°Mn nucleus and is sensitive 
to the environment of the ion, thus the differences between A and B indicate 
binding of Mn’* to the enzyme. Experimental conditions: microwave power 
0.5 mW, modulation amplitude 7 G, temperature 20 K. The right panel shows 
X-band continuous wave frozen solution EPR spectra of (A) Fdc1¥* reduced 
using sodium cyanoborohydride (NaBH3CN) and subsequently exposed to air 


Magnetic Field (G) 


and (B) wild-type Ubix + DMAP reduced with dithionite and reoxidised with 
oxygen. Experimental conditions: microwave power 10 LW, modulation 
amplitude 1.5 G, temperature 20 K. Reduction using NaBH;CN of the modified 
cofactor formed in Fdc1U?* (prEMN7™™2™) gives rise to a radical on air 
oxidation with the same g value (g,,) and line width as that formed on the 
modified flavin (prFMN) in UbiX. However, the lack of the distinctive fine 
structure in A that is normally observed for the UbiX radical B, suggests 
heterogeneity in the Fdc1™ radical or possibly a magnetic interaction with 
the nearby Mn’* ion in Fdcl. 
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Extended Data Figure 5 | Mass spectrometery of the Fdc1”"* cofactor. 

a, (i) Structural elucidation of the Fdc1¥"* cofactor. Full scan total ion current 
(TIC) created under a gradient elution using H,O/acetonitrile both containing 
0.1% formic acid indicating a major peak apex at 9.53 min with a 52/48 solvent 
composition. Also shown is the proposed structure of 523 m/z. Mass spectrum 
taken at major peak apex in (ii) (9.53 min) indicating an associated full-scan 
molecular ion peak with m/z = 523.1589 (M* = CysHogNsOoP) at a resolution 
of 58,501 with a mass accuracy of 0.06 p.p.m. Also eluting alongside the target 
molecule are two isotopic variants containing '°C and '°C,. The '°C peak is 
displaying a relative abundance of 22 to the 523 m/z peak that is in line with the 
number of carbon atoms contained within the structure. Fragmentation of the 
523.16 m/z molecular ion peak (iii) in an automated data dependent manner 
using helium based-chemical induced dissociation (CID level 35) generated a 
spectral tree that indicates the removal of the phosphate head group at the MS” 
level generating 425.18 m/z. Subsequent MS? level activation (CID 35) on 
425.18 m/z partially (A) or completely (B) removing the tail group from 

the newly formed four-ring system generating 335.13 m/z or 309.09 m/z, 
respectively. The presence of a hydrolysed species with 541 m/z is also reported 
(iv). b, Native mass spectra of Fdcl (top) and Fdc1¥™ (bottom). Fdcl presents 
in charge states 19+ to 23+, while Fdc1”™ presents in charge states 19+ to 
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22+. Right-hand spectrum; an enlarged view of the 21+ charge state. The 
predicted masses are shown by blue dashed lines. The spectrum of Fdc17™™ 
shows that approximately two-thirds of the ions have two non-covalently 
bound cofactors, approximately one-third have one non-covalently bound 
cofactor and there is a small amount with no cofactor bound. Fdcl contains no 
cofactor. The measured mass of the Fdcl dimer is 112,265 Da (predicted 
mass from sequence is 112,270 Da). The measured mass for the apo form of 
Fdc1™ is 112 345 Da, slightly higher than for Fdcl which is attributed to an 
increased retention of salt. The mass difference of +80 Da corresponds to the 
mass of two potassium adducts. For the Fdc1”"™ species with one bound 
cofactor, the measured mass (112,968 Da) is 178 Da higher than predicted. The 
predicted mass corresponds to the left hand side of the peak, which is the 
protein plus cofactor with no extra salt retained. The extra mass could be 
attributed to 2 Mn** ions and 2 K* ions. The Fdc1”™* bound to two cofactors 
has a measured mass of 113,583 Da, which is 268 Da larger than expected. 
Again, however, the predicted mass corresponds to the left-hand side of the 
peak. These spectra indicate that the protein dimer carries either 1 or 2 
cofactors of 525 Da, along with a variety of other salt adducts. The extent of 
adductation is higher for Fdc1¥>* and increases with bound cofactors, 
indicating that the addition of the cofactor recruits counter ions. 
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Extended Data Figure 6 | A proposed mechanism for cofactor maturation 


in Fdc1™°™ through oxidation. We propose the prFM 


Nreduced Cofactor 


produced by UbiX is bound by apo-Fdcl1 and oxidized in a stepwise manner. 
Although the initial radical species resembles that observed during non- 


physiological oxidation of the prFM 


Nreduced 


—UbiX product complex’’, we 


propose that in the Fdcl enzyme proton abstraction from the prEMN C1’ 
leads to a distinct radical species. The latter can either be oxidized further to the 


Niminium 


corresponding prFM 


form the central seven-membered ring ultimately leading to prFM 


or, via a radical based isomerization process, to 


jyketimine, 
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Extended Data Figure 7 | DFT models of the proposed prFMN'™™""™ and 
prEMN“‘™"¢ isomers and their substrate adducts. a, Chemical structures of 
the prEMN*™"" and prEMN‘*"™"* models. DET models (pink carbons) are 
overlaid with crystal coordinates (green carbons) of prEMN'™""™ (b) and the 
two butterfly bent conformations of prEMNKtimine (c, d). c, d, The substrate- 
free (bent ‘up’) prFMN**™"* is shown in c and the more planar substrate- 
bound (bent ‘down’) in d. For comparison of the extent of the butterfly bending 
of prFMN**t'™!?*, the two DFT models in c and d are aligned over the four-ring 
nitrogen atoms and overlaid in e. f, Chemical structures of the proposed initial 
prFMN'™™"™ and prEMN“*"™"* substrate adducts with the cinnamic acid 
substrate highlighted in red. The DFT-optimized structures of these species are 
overlaid in g with the prFMN'™"™"™ species shown with pink carbons and the 
prFMN**"™”© species with teal carbons. Three projections of overlaid DFT 
models of the prEMN'™™"™ species with substrate bound (pink carbons; 
reproduced from g), after substrate decarboxylation (teal carbons; substrate 
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double bond is cis) and upon protonation of the substrate B-carbon (green 
carbons) are shown in h). Note that the substrate carboxylate was artificially 
protonated in these models to maintain charge neutrality. Models were 
geometry optimized in the gas phase using either the B3LYP/6-311+ + G(d,p) 
(panels a-e) or BH&H/6-311+ + G(d,p) (panels f-h) level of theory. BH&H 
was chosen over B3LYP for the substrate adducts as BH&H has been shown to 
better describe n-stacking interactions, which are likely to occur between the 
modified isoalloxazine and substrate phenyl moieties*’. Harmonic vibrational 
frequencies calculated using normal mode analysis were used to confirm that 
optimized geometries of all species were in local or global minima. Absolute 
energies and relative free energies of the substrate-free species, determined 
from the normal mode calculations, are given in the table, top right. Cartesian 
coordinates of the optimized structures are given in the Supplementary 
Information. 
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Extended Data Figure 8 | Fdcl decarboxylase activity with simple acrylic 10 mM 2-heptenoic (1-hexene, green; enzyme free control in black) or 10 mM 
acids. Gas chromatogram showing products formed from a solution of 2-octenoic acid (1-heptene, red; enzyme free control in black). Identification of 
respectively 10 mM 2-hexenoic (1-pentene, blue; enzyme free control in black), _1-alkenes was based on known standards. 
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Extended Data Figure 9 | UV-visible spectral properties of Fdcl variants. UV-visible spectra of Fdcl (614 1M), Fdc1Uh* (492 uM), Fdel R173A0X (749 uM) 
and Edel E282QU>* (171 1M) normalised on the Azgo peak. Inset, close up of the additional spectral features present in the 300-500 nm region. 
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Extended Data Table 1 | Data collection and refinement statistics 


A. niger Fdel A, niger Fdel A, niger Fde1 A. niger Fdel A. niger Fdel A, niger Fdel A. niger Fdel S. cerevisiae G 


(only — (prFMN"™""™" + alpha- + penta-fluoro + alpha-fluoro —_—+ phenyl + 4-vinyl Fdel dubliniensis 
prFMN™"™ + methyl cinnamic acid —_cinnamic acid pyruvate guaiacol Fdel 
resent tF MN“Omne cinnamic acid 
PDB code AZA4 AZAS AZAT7 4ZA8 4ZAB AZA9 4AZAA 4ZAC 4ZAD 
Data collection 
Space group P2, 2,2 P2, 2,2 P2,2,2 P2, 2,2 P2,2,2 P2, 2,2 P2,2,2 P2, P2, 
Cell dimensions 
a, b,c (A) 95.99 96.02 95.93 95.91 95.91 96.05 96.59 114.41 91.97 
63.93 63.79 64.16 64.20 64.20 63.85 64.47 96.18 64.55 
87.63 87.72 87.73 87.79 87.79 87.68 87.89 116.64 96.05 
BC?) 96.58 91.07 
Resolution (A) 33.82-1.22 48.01-1.10 87.73-1.1 64.81-1.06 32.38-1.16 63.85-1.01 94.19-1.242 54.19-1.65 67.04-2.46 
(1.25-1.22) (1.13-1.10) (1.13-1.10) (1.09-1.06) (1.19-1.16) (1.04-1.01) (1.274- (1.69-1.65) (2.52-2.46) 
1.242) 
Roim 3.7 (35.4) 3.2 (37.7) 3.5 (30.9) 3.9 (33.9) 3.8 (36.1) 2.7 (31.4) 3.1 (34.4) 3.9 (39.2) 9.1 (48.0) 
I/ol 11.2 (2.0) 12.6 (2.1) 11.20 (2.4) 12.8 (2.5) 12.0 (2.3) 14.3 (2.2) 11.5 (2.0) 11.0 (2.0) 6.7 (1.7) 
Completeness (%) 99.96 99.97 98.0(97.8) 99.96 99.9 (99.9) 99.97 98.9 (94.4) 98.8 (99.7) 99.8 (99.8) 
(99.99) (99.99) (99.98) (99.99) 
Redundancy 6.1 (5.8) 5.5 (3.6) 4.2 (2.7) 5.2 (3.6) 4.2 (2.8) 4.0 (2.2) 6.5 (6.0) 3.7 (3.8) 3.4 (3.4) 
Refinement 
Resolution (A) 33.82-1.22 48.01-1.10 30.05-1.10 64.81-1.06 32.38-1.16 63.85-1.01 94.19-1.25 54.19-1.65 67.04-2.46 
(1.25-1.22) (1.13-1.10) (1.13-1.10) (1.09-1.06) (1.19-1.16) (1.04-1.01) (1.274- (1.69-1.65) (2.52-2.46) 
1.242) 
No. reflections 152260 206025 203456 231449 175535 264342 146799 282233 39186 
(8041) (10689) (10579) (12068) (9183) (13893) (7752) (15010) (1956) 
Ryork/ Riree 14.5/16.1 14.5/15.6 13.2/14.1 14.79/16.61 14.6/15.4 12.99/14.50 = 14.34/16.32 —-16.44/19.23 20.66/25.5 
(23.6/26.4) (25.9/26.9) (20.7/21.2) (23.1/26.9) (23.0/23.5) (24.6/26.0) (25.4/30.0) (31.6/34.8) (35.2/44.2) 
No. atoms 4595 4692 4649 4665 4694 4554 4661 17693 8213 
B-factors 9.81 10.71 10.25 10.321 9.83 9.38 11.07 24.8 26.1 
R.m.s deviations 
Bond lengths (A) 0.028 0.028 0.028 0.029 0.031 0.029 0.026 0.032 0.017 
Bond angles (°) 2.384 2.485 2.469 2.429 2.622 2.250 2.304 2.467 1.726 
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UbiX is a flavin prenyltransferase required for 
bacterial ubiquinone biosynthesis 


Mark D. Whitel, Karl A. P. Payne’, Karl Fisher!, Stephen A. Marshall’, David Parker’, Nicholas J. W. Rattray’, Drupad K. Trivedi’, 
Royston Goodacre’, Stephen E. J. Rigby', Nigel S. Scrutton', Sam Hay! & David Leys! 


Ubiquinone (also known as coenzyme Q) is a ubiquitous lipid- 
soluble redox cofactor that is an essential component of electron 
transfer chains’. Eleven genes have been implicated in bacterial 
ubiquinone biosynthesis, including ubiX and ubiD, which are 
responsible for decarboxylation of the 3-octaprenyl-4-hydroxy- 
benzoate precursor’. Despite structural and biochemical char- 
acterization of UbiX as a flavin mononucleotide (FMN)-binding 
protein, no decarboxylase activity has been detected**. Here we 
report that UbiX produces a novel flavin-derived cofactor required 
for the decarboxylase activity of UbiD*. UbiX acts as a flavin pre- 
nyltransferase, linking a dimethylallyl moiety to the flavin N5 and 
C6 atoms. This adds a fourth non-aromatic ring to the flavin iso- 
alloxazine group. In contrast to other prenyltransferases®’, UbiX is 
metal-independent and requires dimethylallyl-monophosphate as 
substrate. Kinetic crystallography reveals that the prenyltransfer- 
ase mechanism of UbiX resembles that of the terpene synthases’. 
The active site environment is dominated by 7 systems, which assist 
phosphate-C1’ bond breakage following FMN reduction, leading 
to formation of the N5-C1’ bond. UbiX then acts as a chaperone 
for adduct reorientation, via transient carbocation species, leading 
ultimately to formation of the dimethylallyl C3’-C6 bond. Our 
findings establish the mechanism for formation of a new flavin- 
derived cofactor, extending both flavin and terpenoid biochemical 
repertoires. 

Flavin is a common cofactor responsible for highly versatile (redox) 
chemistry’, its properties modified or fine-tuned by the protein scaf- 
fold, occasionally through covalent attachment to the protein matrix 
via the C6 or C8 atoms of the isoalloxazine dimethylbenzene ring’’. We 
have shown that the reversible decarboxylases belonging to the UbiD 
or the homologous Fdc enzyme family require a heavily modified 
FMN cofactor’. In these enzymes, the FMN is linked to a C;-alkyl 
group via N5-Cl’ and C6-C3’ linkages, adding a fourth (non- 
aromatic) ring to the isoalloxazine ring system. Knockout of ubiD/ 
fdcl or the associated ubiX (or the homologous pad1) genes leads to 
similar phenotypes, and the latter have been shown to encode for 
FMN-binding proteins for which no decarboxylase activity has been 
detected in vitro” *"". Recently it was found that UbiX/Pad1 is respons- 
ible for activation of Saccharomyces cerevisiae Fdc1", suggesting that 
UbiX/Pad1 is responsible for the synthesis of the UbiD/Fdc1 cofactor. 
The observed FMN modification by a C;-unit suggests this occurs 
through prenyltransferase activity (Fig. la). 

We tested this hypothesis by incubating Pseudomonas aeruginosa 
UbiX” with oxidized FMN and the universal isoprene precursors 
dimethylallyl-pyrophosphate (DMAPP) or isopentenyl-pyrophosphate 
(IPP). However, spectroscopic evidence for formation of a ternary 
isoprene precursor-FMN-UbiX complex or of covalent FMN modi- 
fication could not be obtained. By contrast, perturbation in the FMN 
ultraviolet (UV)-visible spectrum was observed in the presence of 
dimethylallyl-monophosphate (DMAP), leading to an apparent Kg 
of 12.0 + 0.4 uM (Fig. 1b). Although the minor spectral perturbation 


is indicative of binding close to the FMN isoalloxazine, it does 
not reflect FMN modification. However, reduction of the ternary 
FMN-DMAP-UbiX complex using sodium dithionite followed by 
re-oxidation under aerobic conditions leads to formation of a stable 
purple-coloured intermediate (Fig. 1c). Ultra-high-performance 
liquid chromatography coupled with high-resolution mass spectro- 
metry of extracts from both the reduced and re-oxidized ternary 
FMN-DMAP-UbiX complex reveal a molecular species with 
mass corresponding to the reduced (prenylated-FMN™“"*" or 
prFMN""<* ) or radical (pr—EMN"™4i!) form of the UbiD/Fdcl 
cofactor, respectively (Extended Data Fig. 1) and electron paramag- 
netic resonance (EPR) spectroscopy confirms the formation of a rad- 
ical in ~95% yield (relative to FMN) in the reoxidized ternary 
complex (Fig. 1d). Furthermore, the radical species has EPR prop- 
erties consistent with that of a N5, C6 alkylated flavin semiquinone 
(Extended Data Figs 2-4). Activity of Aspergillus niger apo-Fdcl could 
be reconstituted in vitro following anaerobic incubation with 
prFMN™"<*4_UbiX. Decarboxylase activity could only be observed 
after exposure of the reconstituted Fdcl to oxygen (Fig. le), but 
appeared independent of the presence of UbiX (Extended Data 
Fig. 5a). No activity could be observed when incubating apo-Fdcl 
with the prFMN™“"*'_UbiX. This suggests only the prFMN"™“"<e4 
form can be correctly oxidized by UbiD/Fdcl to the corresponding 
reactive N5-iminium species (prFMN'™"™"™), 

Stopped-flow experiments mixing reduced FMNH,-UbiX with 
DMAP under anaerobic conditions reveal transient formation of a 
distinct spectral species (species 2 in Fig. 1f) before formation of 
the prFMN"“"*4 product. The rate of species 2 formation showed 
a hyperbolic dependence on DMAP concentration with apparent 
Kg = 730 + 70 uM and limiting rate of ky = 177 + 7 $i" 
(Extended Data Fig. 5b, c). The decay was independent of DMAP con- 
centration and occurred with kp = 0.316 + 0.002 s~'. Stopped-flow 
experiments mixing reduced prFMN™°_UbiX with oxygen 
reveal that the rate of formation of the purple-coloured end product is 
linearly dependent on oxygen concentration (Extended Data Fig. 5d, e). 

We obtained the 1.8 A crystal structure of P. aeruginosa UbiX in 
complex with oxidized FMN and DMAP (Fig. 2a, b). The DMAP 
substrate is located directly above the FMN isoalloxazine re face, with 
the dimethylallyl moiety sandwiched between the FMN and A89-S90. 
The dimethylallyl group is furthermore surrounded by aromatic 
ring systems of W84, Y169 and W200 that, together with the FMN 
dimethylbenzene moiety, resemble the m-cage found in other prenyl- 
transferases or terpene synthases* *. The phosphate moiety is bound by 
several cationic residues, including conserved residues R122, K129, 
R139 and R185 in addition to polar interactions with $90, the back- 
bone amide of G91, E140 and Y169 (Extended Data Fig. 6). The short 
N5 to substrate Cl’ distance (3.3 A) and relative small N10-N5-Cl’ 
angle of 98° are compatible with those observed for other flavin- 
substrate complexes”"*. No evidence of direct nucleophilic/oxidative 
attack of the oxidized FMN N5 was observed. Substrate complex 
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Figure 1| P. aeruginosa UbiX solutions studies. a, Schematic overview 

of the proposed UbiX reaction. The N5-Cé6 prenylated FMNH, product 
(prFMN"“"***) undergoes (probably non-physiological) oxidation to a radical 
species (prFMN‘*“'!) in the presence of oxygen (see d). In the presence of 
apo-UbiD or apo-Fdcl1, we propose that the UbiX product is oxidized to the 
UbiD/Fdcl prFMN'™"™""™ cofactor (see e). b, Titration of oxidized FMN-UbiX 
with DMAP. Grey lines represent individual scans at increasing DMAP 
concentrations with the black line representing the final spectrum obtained at 
saturation. The binding curve is derived from the global absorbance change in 
the 310-450 nm range. c, UV-visible spectra of Ubix-FMN obtained during 


crystals that are transiently reduced with sodium dithionite under 
aerobic conditions turned purple upon oxidation, closely resembling 
solution properties. The 1.6 A crystal structure of a purple-coloured 
crystal reveals formation of the four-ring prFMN™“' (Fig. 2d). 
Although product formation has little effect on the active site structure, 
the presence of the fourth ring is accompanied by a slight rotation of 
the FMN at the N3 atom. This accommodates the extended isoallox- 
azine ring system while keeping the hydrophilic network involved in 
binding the isoalloxazine ring intact. These observations establish 
UbixX crystals are catalytically competent, but do not reveal the order 
in which both N5-Cl1’ and C6-C3’ linkages are made. 

We used kinetic crystallography to provide detailed insights into the 
UbiX mechanism, taking advantage of the fact that a relatively long- 
lived intermediate can be observed in solution studies (Fig. 1f). 
Reduction of the enzyme-substrate complex by sodium dithionite 
followed by rapid-freezing and structure determination to 1.4 A 
reveals that formation of an unusual N5-C1’ dimethylallyl adduct 
occurs within 1-5 s, establishing that N5-C1’ bond formation occurs 
first (Fig. 2c). The N5-C1' adduct adopts a significant butterfly con- 
formation, and the N5 is clearly sp” hybridized. Although $15 and E49 
are located in proximity of the N5, a (transient) reorientation of both 
side chains is required to establish a hydrogen-bonding network with 
the N5. The 1.9 A structure of substrate complex crystals frozen 20-30 s 
following reduction reveals formation of the prFMN"“"<** product 
(Extended Data Fig. 7). Longer incubation times lead to formation of 
the prFMN radical species as indicated by a gradual purple colouration 
of crystals. No other intermediates were seen to accumulate within the 
wild-type crystals, in agreement with our stopped-flow solution data 


(Fig. 1f). 
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redox cycling in presence of DMAP. d, EPR spectrum of the UbiX- 
prEMN**“*" complex. e, Reconstitution of A. niger apo-Fdcl decarboxylase 
activity by incubation with wild-type and variant UbiX enzymes in presence of 
DMAP and FMNH, followed by oxygen exposure. No activity can be observed 
under anaerobic conditions. f, Singular value decomposition of rapid-scan 
stopped-flow spectrophotometric data following mixing of wild-type UbiX- 
FMNH, with DMAP. The spectral species identified can tentatively be assigned 
to (1) ternary UbiX-FMNH, -DMAP complex; (2) an intermediate covalent 
adduct formed between FMNH) and DMAP and (3) the UbiX-prFMN‘™“1<e4 
product. 


In order to determine whether S15 and/or E49 are involved in N5 
deprotonation and/or formation of the C6-C3’ linkage, we created an 
E49Q variant. The mutation severely affects but does not abolish in vitro 
Fdcl activation (Fig. le). Although the oxidized substrate E49Q complex 
is similar in structure to the corresponding wild-type structure (Fig. 3a), 
a distinct intermediate species was seen following reduction and rapid 
freezing within 1-5 s (Fig. 3b). Although the latter structure clearly 
contains a reduced FMNH,, no N5-Cl’ bond has formed, in contrast 
to the corresponding wild-type intermediate structure obtained within 
1-5 s following reduction (Fig. 2c). Furthermore, both $15 and Q49 
establish a hydrogen-bonding network with the N5. This suggests N5 
deprotonation through S15 and E49 is linked to N5-C1’ bond forma- 
tion, a process largely rendered ineffective through the E49Q mutation 
(Fig. 4, species reduced) Structures of the E49Q variant derived for 
crystals frozen 0.5-10 min after reduction reveal that accumulation of 
an N5-Cl1' dimethylallyl adduct does occur (Fig. 3c), albeit distinct in 
conformation from that observed in the 1-5 s wild-type structure 
(Fig. 2c). 

In the E49Q N5-C1’ dimethylallyl adduct, the dimethylallyl sub- 
strate-derived moiety has undergone a 180° rotation when compared 
to the wild-type N5-adduct species, along with a small rotation of 
the isoalloxazine ring (Figs 3c and 4, species IVa). Furthermore, 
both $15 and Q49 form a hydrogen-bonding network with N5, with 
the $15-N5 hydrogen bond maintaining the N5 in an sp’ state. This 
series of conformational changes achieves two objectives: a proton 
relay network is established between the N5 and solvent via S15 
and E49 and additional space is created between the isoalloxazine 
dimethylbenzene moiety and Y169 to allow for formation of the 
fourth ring. 
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Figure 2 | Crystal structures of P. aeruginosa UbiX. a, Overview of the 
dodecameric UbiX structure, with 3 individual monomers (coloured in cyan, 
blue and purple, respectively) contributing to a single active site shown in 
cartoon representation while the remainder (in grey) is shown in surface 
representation. The bound FMN and DMAP substrates are shown in atom- 
coloured spheres. b, Detailed view of the oxidized Ubix-FMN-DMAP 
complex in two orientations related by a 90 degree rotation along the horizontal 
axis. Individual amino acids contributing to the active site structure are shown 
in atom coloured sticks (carbons colour coded as in Fig. 2a). Residues 
positioned in front of the FAN-DMAP substrates are shown as thin lines for 
clarity. The omit map for the DMAP substrate is shown as a green mesh, 
contoured at 4c. Hydrogen-bonding networks established with the phosphate 
moiety are shown by dotted lines. c, Detailed view of the N5-C1' alkyl adduct 
species (in two orientations related by a 90 degree rotation along the horizontal 
axis) obtained through rapid flash-cooling following reduction. The omit map 
for the N5-C1’ adduct is shown as a green mesh, contoured at 40. d, Detailed 
view of the UbiX-product complex in two orientations related by a 90 degree 
rotation along the horizontal axis. The omit map for the product is shown as 
green mesh, contoured at 3.50. 
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Figure 3 | Crystal structures of P. aeruginosa UbiX"*"®. a, Detailed view 
of the Ubix®*°°-FMN-DMAP complex with individual amino acids 
contributing to the active site structure shown in atom-coloured sticks 
(carbons colour coded as in Fig. 2a). Two orientations are displayed as in Fig. 2. 
The omit map for the DMAP substrate is shown as green mesh, contoured at 
4o. b, Detailed view of the Ubix'“°°-FMNH, -DMAP complex obtained 
through rapid flash-cooling following reduction. The omit map for the 
DMAP substrate is shown as green mesh, contoured at 3c. c, Detailed view of 
the Ubix'“?2 N5-C1’ alkyl adduct species obtained through flash-cooling 
following reduction. The omit map for the N5-C1’ adduct is shown as green 
mesh, contoured at 4<. 


Many of the conformational changes observed in the E49Q N5- 
dimethylallyl adduct are unlikely to present a significant barrier to 
the reaction even within the crystals. However, the observed reorienta- 
tion of the dimethylallyl moiety would require substantial protein 
breathing motions to occur if achieved through rotation along the 
dimethylallyl C1-C2 bond. An alternative route through olefin iso- 
merization is possible, via protonation of the C2’ atom (adding a pro-S 
hydrogen) with transient formation of a ternary C3’ carbocation 
(Fig. 4, species III). The latter could reorient in the absence of active 
site deformation and be converted to the observed E49Q dimethylal- 
lyl-N5 adduct (Fig. 4, species IVa) by abstraction of the C2’ pro-R 
proton. The substrate-derived phosphate is ideally positioned to estab- 
lish a proton relay network to the substrate C2’, and the ternary C3’ 
carbocation resides within a m-cage made by the FMN dimethylben- 
zene moiety, Y169 and W200 that could assist carbocation formation. 
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Figure 4 | Schematic representation of the proposed UbiX mechanism. 
DMAP derived atoms are shown in red. Roman numerals indicate the various 
intermediate species proposed (see main text). References to individual figures 


Surprisingly, despite little difference in structure, crystals of a Y169F 
variant are severely compromised in their catalytic ability. A similar 
observation is made when using UbiX’!®? variant for in vitro 
reconstitution of Fdcl activity (Fig. le). In the Ubix”'®”* crystals, an 
N5-Cl1’ dimethylallyl adduct nearly identical to that seen for the wild- 
type crystals can be readily obtained, but remains stable for several 
minutes (Extended Data Fig. 8). This suggests Y169 is key to the 
conformational changes that follow formation of the first N5 adduct 
(Fig. 4, species III), possibly through assisting proton transfer via the 
substrate phosphate to the dimethylallyl C2’ or stabilization of the 
ternary C3’ carbocation. 

We have been unable to trap any intermediates during formation of 
the dimethylallyl C3’-flavin C6 bond. We suggest C6 nucleophilic 
attack on the dimethylallyl C3’ carbocation occurs concomitant with 
or following protonation of the C2’ via the bound phosphate (Fig. 4, 
species IVb). The resulting cyclohexadiene adduct (species V) can 
form the final product (species VI) through aromatization concom- 
itant with proton abstraction via S15 and E49 (Extended Data Fig. 9). 
We suggest the E49Q mutation also affects this particular deprotona- 
tion, leading to the accumulation of an intermediate (species IVa) 
preceding this step as observed in the crystals. It is possible that forma- 
tion of the cyclohexadiene adduct (V) occurs directly following con- 
formational change of the C3’ carbocation species (III), via species 
IVb without intermediate deprotonation/protonation as outlined for 
species IVa. 

In addition to UbiX, a distinct family of flavoenzymes (type 2 
isopentenyl diphosphate isomerases) bind isoprene precursors. The 
latter have been suggested to use the flavin as an acid-base in the 
interconversion of IPP and DMAPP!*!°. Instead, our data reveal 
UbixX has many similarities to terpene synthases, achieving the required 
isoprenyl chemistry via similar strategies (Fig. 4)*’”"*. We suggest 
formation of an initial allylic carbocation through leaving of the 
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after roman numerals refer to corresponding crystal structures obtained for the 
wild type or Ubix™*?° mutant. 


phosphate group (pyrophosphate in the terpene synthases) is achieved 
through stabilization of the carbocation species within an active site 
dominated by m-systems (the m-cage). Furthermore, FMN reduction 
appears to act as the trigger for phosphate-C1' bond breakage, increas- 
ing the electron density of the isoalloxazine ring that stacks with the 
dimethylallyl substrate. In contrast to the metal-requiring terpene 
synthases, the leaving group is stabilized by a multitude of ionic inter- 
actions. A direct contact with the conserved E140 furthermore suggests 
phosphate protonation occurs concomitant with C1’ allylic carboca- 
tion formation and/or N5 nucleophilic attack (Fig. 4, species I°"<°*) 
Following formation of the N5-C1' bond, UbiX appears to act as a 
chaperone for substrate reorientation (conversion of species III to 
IVb), similar to what is postulated to occur for terpene synthases. In 
UbiX, it appears both the leaving group phosphate and a conserved 
tyrosine residue (Y169) are involved in catalysing the conformational 
change of the N5 adduct required for completion of the reaction. 

The presence of ubiX/ubiD-like genes in the majority of 
microbes*’?~*! and the essential role played in prokaryotic ubiquinone 
biosynthesis’ suggest an ancient evolutionary origin for the UbiX fla- 
vin prenyltransferase chemistry. It remains unclear why dimethylallyl- 
monophosphate is used by UbiX, as opposed to the universal dimethy- 
lallyl-diphosphate”. This might serve as a means to regulate product 
formation via availability of DMAP. Although the isomer isopentenyl 
monophosphate has recently been shown to occur in certain archaea”, 
the metabolic route to dimethylallyl-monophosphate remains unclear. 
The unusual biochemical strategy of using a reduced rather than oxi- 
dized flavin to act as a nucleophile®**”* ensures formation of a rela- 
tively stable N5-alkyl adduct species (pr—FMN"*“"“**), as opposed to the 
corresponding N5-iminium adduct (prFMN"™"™"”), The latter would 
be prone to hydrolysis before transfer to apo-UbiD/Fdcl. Whether 
non-ubiD related enzymes make use of the novel chemical properties 
of the UbiX product remains to be established. 
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Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Cloning of P. aeruginosa UbiX for heterologous expression in E. coli. The ubix 
gene of Pseudomonas aeruginosa was amplified from synthesized DNA (codon 
optimized for expression in Escherichia coli, Genescript) by PCR using Phusion 
polymerase (NEB). The PCR products were cloned in to the Ncol and HindIII 
restriction sites of the pNIC28a-Bsa4 vector using ligation-independent methods 
(Infusion HD, Clontech). This generated a construct with an N-terminal poly- 
histidine tag and TEV protease cleavage site similar to that used in ref 12. The 
Infusion product was transformed into E. coli NEB5a competent cells (NEB) to 
allow verification of the expression construct by sequencing (Eurofins). Validated 
pNIC28-Bsa4-ubix was transformed into E. coli BL21 (DE3) competent cells 
(NEB) for protein production. 

Cloning of A. niger Fdcl for heterologous expression in E. coli. The A. niger 
fdcl gene was codon optimized to remove codons that were rare in E. coli and 
synthesized (Genscript) in pUC57. The gene was excised out of pUCS57 using Ndel 
and Xhol, cloned into the NdeI and Xhol sites of pET30a using T4 ligase (NEB) 
and transformed into E. coli NEB5«. Once the sequence of the insert was con- 
firmed, A. niger Fdcl pET30a was transformed into E. coli Bl21(DE3). 
Mutagenesis of P. aeruginosa UbiX for heterologous expression in E. coli. 
Mutagenesis primers were designed using the QuikChange Primer Design 
Program (http://www.genomics.agilent.com/primerDesignProgram.jsp) and used 
in conjunction with Phusion-polymerase-mediated PCR. Template DNA was 
removed by DpnI (NEB) digest and the PCR products were transformed into 
E. coli NEB5a competent cells to allow mutagenesis to be verified by sequencing. 
Constructs possessing validated mutations were transformed into E. coli BL21 
(DE3) competent cells for protein production. 

Heterologous expression of P. aeruginosa UbiX (wild-type and variants) and 
A. niger Fdc1. E. coli transformants were grown in 1 | of Luria-Bertani (LB) broth 
at 37 °C and 180 r.p.m. until an Agoonm (OD¢o9) of 0.6-0.8 was reached. Cultures 
were induced with 0.3 mM IPTG and grown overnight at 20°C and 180 r.p.m. 
Cells were collected by centrifugation at 7,000g for 10 min. 

Purification of P. aeruginosa UbiX (wild-type and variants). Cell pellets were 
resuspended in 50 mM Tris, 500 mM NaCl pH 8.0 buffer supplemented with 
lysozyme, RNase, DNase (Sigma) and a Complete EDTA free protease inhibitor 
cocktail (Roche). The cells were lysed using a French Press cell disrupter at 
20,000 p.s.i. followed by centrifugation at 48,000g for 30 min and filtration through 
a 0.45 micron membrane. The soluble supernatant was loaded on to equilibrated 
Ni-NTA agarose resin (Qiagen) and washed successively with 4 column volumes 
of resuspension buffer containing 10 and 40 mM imidazole. The protein was 
eluted with 250 mM imidazole and analysed by SDS-PAGE to verify sample 
purity. Imidazole was removed using a 10-DG desalting column (Bio-rad) equili- 
brated with 20 mM Tris, 200 mM NaCl pH 8.0. Where necessary, the samples were 
incubated with 10 mM FMN to improve flavin occupancy in the protein. Excess 
FMN was removed before experimentation by desalting, as described above. The 
N-terminal polyhistidine tag was retained for all experiments. 

Purification of A. niger Fdcl1. A. niger Fdcl was purified as described above using 
50 mM Tris, 200 mM NaCl pH 7.5 as the resuspension buffer and 20 mM Tris, 
100 mM NaCl pH 7.5 as the desalting buffer. 

UV-visible spectrometry and protein quantification. UV-visible absorbance 
spectra were recorded using a Cary UV-visible spectrophotometer. Protein con- 
centrations were estimated from A ,9 using their respective absorbance coeffi- 
cients, which were calculated from their primary amino acid sequence using 
the ProtParam program on the ExPASy proteomics server. P. aeruginosa UbiX 
(wild-type and E49Q) &2g9 = 16,960 M-‘ cm’; Y169F 29 = 15,470 M-‘ cm™! 
and; W200F €2g9 = 11,460 M_! cm. The concentration of A. niger Fdcl and 
FMN were estimated using €2g9 = 68,870 M tcm !and €459 = 12,500 M! cm", 
respectively. 

Analysis of DMAP binding to P. aeruginosa UbiX. Distinct perturbations in the 
absorbance profile of FMN allowed the binding of DMAP to be monitored using 
UV-visible spectroscopy. The effects of titrating minimal volumes of 12.5 1M 
DMAP on the absorbance spectrum of 30 1M UbiX:FMN were recorded and 
converted to difference spectra by subtracting titrated profiles from the DMAP- 
free sample. The dissociation constant of DMAP was determined by analysing the 
total deviation in absorbance between 310 and 540 nm in function of DMAP 
concentration using a tight binding equation. 

Reduction and oxidation of P. aeruginosa UbiX (wild-type and variants) for 
UV-visible spectroscopy and EPR analysis. All anaerobic work was performed 
under N; ina glove box (Belle Technology, UK). 150 1M P. aeruginosa UbiX:FMN 
was rendered anaerobic in 20 mM Tris, 200 mM NaCl pH 8.0 and mixed with 
1mM DMAP to monitor the effect of redox cycling on flavin properties. Sodium 
dithionite was titrated into the sample to fully reduce the FMN, which was sub- 
sequently re-oxidized. UV-visible spectra were recorded at each stage and 300-pll 
aliquots were isolated for EPR analysis. 
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Preparation of cofactor extracts for mass identification. The oxidized product 
of UbiX turnover was extracted from P. aeruginosa UbiX under aerobic conditions 
by denaturing the protein with equal volumes of acetonitrile at 70 °C. Precipitate 
was removed by centrifugation at 16,000g for 10 min. The resulting supernatant 
was sent for high-resolution mass analysis. 

The reduced product of UbiX turnover was generated under anaerobic condi- 

tions from a steady-state reaction containing 1 mM FMNH), 1 mM DMAP and 
20 UM P. aeruginosa UbiX. Protein was removed by precipitation at room tem- 
perature as described above. The resulting supernatant was sent for analysis by 
high-resolution mass spectrometry. 
Stopped-flow kinetics of P. aeruginosa UbiX. The kinetics of a single turnover 
reaction between UbiX:FMNH, and DMAP was studied under anaerobic condi- 
tions on a SX.18MV-R stopped-flow spectrophotometer (Applied Photophysics, 
dead time approximately 1 ms) by measuring changes in the flavin absorbance 
spectrum logarithmically over 60 s using a photodiode array (PDA) detector. 
100 uM UbiX:FMN was rendered anaerobic in 20 mM Tris, 200 mM NaCl pH 
8.0 and supplemented with 1 mM glucose, 10 U ml * glucose oxidase (Sigma) to 
guarantee an oxygen-free environment. The protein was reduced with sodium 
dithionite before being rapidly mixed with equal volumes of 500 hM-4 mM 
DMAP at room temperature. The spectral intermediates were resolved by singular 
value decomposition using the Pro-Kineticist program (Applied Photophysics). 
Data could be fitted with minimum residuals to a three step model 
(A >B>C>D), which allowed the rate constants kg, kg, and ke; to be estimated. 
Species D was identified as a photoinduced adduct and considered an artefact of 
the prolonged illumination required. 

The kinetic parameters of a single turnover reaction between UbiX:prFMN"™*"<<4 

and oxygenated buffer were measured using a stopped-flow instrument (see above). 
Here 100 1M UbiX:FMN was rendered anaerobic (in the absence of glucose oxi- 
dase), before being reduced with dithionite and mixed with 1 mM DMAP. The 
protein was rapidly mixed with equal volumes of 0-100% oxygenated buffer, pro- 
ducing absorbance spectra that could be analysed by single value decomposition. 
Data were fitted to a one-step model (A > B), which allowed the rate constant kap 
to be estimated. 
In vitro reconstitution of A. niger Fdcl. For reconstitution under steady-state 
conditions, a solution containing 1 mM FMNH), 1 mM DMAP and, respectively, 
wild-type P. aeruginosa UbiX, E49Q or Y169F was incubated with 50 iM A. niger 
apo-Fdcl in 20 mM Tris, 200 mM NaCl, 1 mM MnCl, pH 8.0 under anaerobic 
conditions. The specific activity of reconstituted Fdcl with 50 1M cinnamic acid 
(in 50 mM potassium phosphate, 50 mM KCl pH 6.0; either aerobic or anaerobic) 
was determined at regular time intervals by measuring the rate of decarboxylation 
at 270 nm using a Cary UV-visible spectrophotometer. 

Reconstitution experiments using a filtered UbiX reaction were carried out 
using 2 mM FMNH) anaerobically incubated overnight with 5 mM DMAP and 
50 uM UbixX. This reaction mixture was used to reconstitute A. niger apo-Fdcl 
(supplemented with Mn**) in a 2:1 molar ratio, assuming complete conversion 
of FMNH, to prEMNT<uced, To obtain filtrate devoid of UbiX, the reaction 
mixture was anaerobically filtered in a 10K MWCO centrifugal concentrator 
(Generon). A negative control with DMAP omitted from the overnight reaction 
was also performed. Decarboxylase activity was measured using 800 JM aerobic 
cinnamic acid. 

Crystallization of P. aeruginosa UbiX (wild-type and variants). A total of 
7.5mgml ' P. aeruginosa UbiX in 20 mM Tris, 200 mM NaCl pH 8.0 was 
supplemented with 0.5 mM FMN and mixed with 1 mM DMAP to promote ligand 
binding in the active site. Initial screening, sitting 0.3 pl protein and 0.3 pil mother 
liquor next to 50 il reservoir, yielded a variety of hits after 1-2 days at 25 °C. The 
best crystals were obtained after 1-2 days in 12% PEG 3350, 150 mM sodium 
thiocyanate, and 100 mM Tris pH 7.2 at 25 °C. Crystals of the UbiX variants were 
obtained in the same conditions. 

Diffraction data collection and structure elucidation. All crystals were cryo- 
protected in mother liquor supplemented with 10% PEG 200 and flash-cooled in 
liquid nitrogen. Reaction intermediates were obtained by flash cooling crystals that 
had been incubated in cryoprotectant containing sodium dithionite. Where per- 
iods of long soaking were necessary, the cryoprotectant was also supplemented 
with FMN and DMAP to reduce dissociation from the crystals. Diffraction data 
was collected up to 1.4 A at Diamond beamlines at 100K and processed using the 
CCP4 suite”. Data was reduced and scaled using XDS’*. The structure of 
P. aeruginosa UbiX was refined using REFMACS, using 3ZQU as the starting 
model, and refined by cycles of manual rebuilding in COOT and additional 
processing in REFMACS (ref. 25). Ligand coordinates and definitions were gen- 
erated using the GlycoBioChem PRODRG2 server (http://davapcl.bioch.dun- 
dee.ac.uk/cgi-bin/prodrg). The data and refinement statistics are available in 
Extended Data Table 1. 
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Mass spectrometry. All solvents were of HPLC-MS grade and purchased from 
Sigma-Aldrich (Gillingham, UK). HPLC grade formic acid was purchased from 
Fisher Scientific (Loughborough, UK). Mass spectrometer calibration solution and 
chromatography columns were purchased from Thermo-Fisher Scientific (Hemel 
Hempstead, UK). 

All UHPLC-MS work was carried out on a Thermo-Finnigan Orbitrap-LTQ XL 
hybrid mass spectrometer operated in negative ionization mode coupled to a 
Thermo Accela autosampler (Fisher Scientific, Bremen, Germany). Chromato- 
graphic separations were adapted from work carried out by Fu and co-workers”’ 
and performed on a Thermo Hypersil Gold 2.1 um Cg column at a solvent flow- 
rate of 400 pl min” *. For initial profiling tests the column was eluted with 0.1% 
formic acid in water (A) and 0.1% formic acid in acetonitrile (B). The solvent 
composition during gradient elution was initiated with 5% (B) for 5 min and 
subsequently ramped to 95% (B) over 15 min, followed by a 5 min isocratic elution 
at 95% (B) before a return to 95% (A) held for further 5 min for column equilib- 
ration. All samples were maintained at 4 °C within the autosampler refrigerator 
while the column was maintained at 50°C within the autosampler oven. Mass 
calibration was carried out in accordance with the manufacturer’s guidelines 
using caffeine, the tetrapeptide MRFA and Ultramark 1621 in an acetonitrile/ 
methanol/acetic acid solution. Acquisition settings for initial profiling were 
carried out at 60,000 resolution in centroid and ran at 1 pt-scan per 400 ms in 
the 100-1,000 m/z range with source gasses set at sheath gas = 40 arbitrary units, 
aux gas = 5 arbitrary units, sweep gas = 5 (all systems specific arbitrary units). The 
ESI source voltage was set to 4.2 V, and capillary ion transfer tube temperature set 
at 275 °C. 

Mass fragmentation analysis was carried out with the same column chemistry, 
source settings and flow rate but with an isocratic solvent elution of 55% (A)/45% 
(B), an optimum composition determined for eluting the analyte of interest, based 
on the initial elution profile described above. MS source, sample storage/column 
was kept under identical conditions. Collision induced dissociation (CID) settings 
were set up to trap target ions with an isolation width of 1.0 m/z, normalized 
collision energy of 35, activation Q of 0.250 and activation time of 30 ms. 


EPR spectroscopy. Continuous wave X-band (~9.4 GHz) EPR spectra were 
obtained using a Bruker ELEXSYS E500 EPR spectrometer operating at cryogenic 
temperatures via an Oxford Instruments ESR900 liquid helium cryostat interfaced 
with an ITC503 temperature controller from the same manufacturer. Samples 
were 250 pl in Wilmad 4 mm outer diameter quartz tubes. Sample temperatures 
were as stated, microwave power was 10 LW, modulation amplitude was 1.5 G. 
Pulsed electron nuclear double resonance (ENDOR) spectra were obtained at 65 K 
using a Bruker ELEXSYS E580 spectrometer equipped with an EN 4118X-MD4 
dielectric resonator. Temperature control was effected using an Oxford 
Instruments CF935 cryostat and ITC503 temperature controller. The Davies 
pulsed ENDOR sequence was employed at intermediate Q using soft microwave 
pulses and FID detection (n/2 = 200 ns) with a 9 ms radiofrequency 1 pulse. 
DFT calculations. Density function theory (DFT) models of FAN:DMA adducts 
were optimized in the gas phase using the (U)BP3LYP/6-311+ + G(d,p) level of 
theory implemented in Gaussian 09 (ref. 28). The models consist of an FMN 
truncated after the 2’ carbon and 2-methyl-2-butene, that is, the dephosphorylated 
DMAP (Extended Data Fig. 4). Structural alignments to the crystal coordinates 
were performed using Swiss-PdbViewer version 4.1 (ref. 29). Harmonic vibra- 
tional frequencies calculated using normal mode analysis were used to confirm 
that optimized geometries were always in local or global minima. 

Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. 
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Extended Data Figure 1 | Mass spectrometric analysis of the UbiX product. 
a Structural elucidation of the reduced UbiX product. From an initial full scan 
TIC on UbixX extract, a 525 m/z ion extracted chromatogram was created under 
a gradient elution using H,O/acetonitrile both containing 0.1% formic acid 
indicating a major peak apex at 9.57 min with a 54/46 solvent elution 
composition (not shown). Subsequent data dependant TIC and 525 m/z scan 
extracted chromatograms were created under 55% A/45% B isocratic solvent 
elution and ion extraction between 524.5-525.5 m/z produced a singular peak 
at 2.28 min displaying an associated full-scan molecular ion peak with m/z = 
525.1726 (M* = Cj,H39N4O0¢P) at a resolution of 58,500 with a mass accuracy 
of 3.59 p.p.m. Fragmentation of the 525.1726 m/z molecular ion peak in an 
automated data dependent manner using helium-based chemical-induced 
dissociation (CID level 35) generated a spectral tree that indicates the removal 
of the newly formed, more labile, tertiary ring at the MS level. Subsequent 
removal of the phosphate head group at the MS? level was achieved using CID 
35 on the 456.23 m/z molecular species with a final MS" step using CID 35 on 
358.18 m/z completely removing the tail group from the central three-ring 


isoalloxazine system. b, Structural elucidation of the re-oxidized UbiX/Fdc1 
cofactor. From an initial full scan TIC on UbiX extract (i), a 524 m/z ion 
extracted chromatogram was created under a gradient elution using H,O/ 
acetonitrile both containing 0.1% formic acid indicating a major peak apex at 
9.24 min with a 48/52 solvent composition (not shown). Subsequent data 
dependant TIC and 524 m/z scan extracted chromatograms (ii) were created 
under 50% A/50% B isocratic solvent elution and ion extraction between 523.5- 
524.5 m/z produced a singular peak at 2.08 min displaying an associated full- 
scan molecular ion peak with m/z = 524.1656 (M* = CyHo9NaQoP) at a 
resolution of 58,500 with a mass accuracy of 2.78 p.p.m. Fragmentation of the 
524.1656 m/z molecular ion peak in an automated data dependent manner 
using helium-based chemical-induced dissociation (CID level 35) generated a 
spectral tree (iii) that indicates the removal of the newly formed, more labile, 
tertiary ring at the MS” level. Subsequent removal of the phosphate head group 
at the MS’ level was achieved using CID 35 on the 498.31 m/z molecular species 
to create 327.18 (A) alongside a sister fragment 455.31 (B) that represents the 
full removal of the tertiary ring but retaining the phosphate head group. 
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Extended Data Figure 2 | EPR spectroscopic analysis of the UbiX radical 
product. a, X-band continuous wave EPR spectra of UbiX in frozen solution: 
(i) wild type (WT) as isolated; (ii) WT plus DMAP; (iii) WT reduced with 
dithionite; (iv) WT + DMAP reduced with dithionite; (v) WT + DMAP 
reduced with dithionite and reoxidized with oxygen; (vi) Y169F mutant + 
DMAP reduced with dithionite and reoxidized with oxygen; (vii) W200F 
mutant + DMAP reduced with dithionite and reoxidized with oxygen. The 
FMN-DMAP adduct radical is only formed when UbixX is reoxidized in 

the presence of DMAP and this formation is not affected by mutation of those 
aromatic residues forming the m-cage that could give rise to Y or W radical 
species. b, X-band continuous wave EPR spectra of frozen solutions of WT 
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UbiX + DMAP and reduced with dithionite with the addition of potassium 
ferricyanide to the following concentrations: (i) 260 1M; (ii) 160 |1M; (iii) 

50 uM; (iv) 40 LM; (v) 30 LM; (vi) 20 1M; (vii) 0 uM. Experimental conditions: 
microwave power 10 LL. W, modulation amplitude 1.5 G, temperature 20 K. 
Showing the radical can also be formed using chemical oxidation in the absence 
of oxygen and thus does not arise from a peroxide species generated by the 
reaction of reduced oxygen species formed when the dithionite sample is 
exposed to oxygen. An initial radical is formed under these conditions that 
exhibits a considerably broader EPR signal than the prFMN™“* and is as 
yet unidentified. 
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Extended Data Figure 3 | Pulsed Davies ENDOR spectra of the 
prFMN"™““_UbiX complex. The spectrum was measured at a field equivalent 
to gay = 2.0033. Although a complete assignment of the spectrum requires 
specific deuteration of FMN and DMAP, the ENDOR spectrum is dominated 
by two large hyperfine couplings to B-protons indicated as Ha and Hg. Using 
the Heller-McConnell equation the values of the dihedral angles, 0, can be 
determined as shown and are consistent with the orientation of the C1’-protons 
of the DMAP-derived fragment of the radical observed crystallographically, 


as shown in the figure above. The unpaired electron spin density, p, at N5 of the 
FMN-derived fragment of the radical can also be estimated from the Heller- 
McConnell equation. B’ is negligible, whereas B’’ is thought to have a value 
of ~160, although studies of B-protons coupled to unpaired electron spin 
density at a nitrogen atom are rare, giving an unpaired spin density at N5 of 
~0.3, consistent with calculations and considerably smaller than the unpaired 
electron spin density of 0.4 or greater expected for C1’ of an aromatic amino 


acid radical. 
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Extended Data Figure 4 | DFT modelling of the UbiX product. Top, DFT — model was geometry optimized in the gas phase using the UB3LYP/6- 

model of the purple radical species showing the location of significant atomic 311++G(d,p) level of theory. Cartesian coordinates of the optimized structure 
spin densities (>|0.02|) to the right. The optimized structure (blue carbons) are given in the Supplementary Information. 

overlaid with the crystal coordinates (green carbons) is shown below. The 
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Extended Data Figure 5 | Additional UbiX solution studies. 

a, Reconstitution of A. niger Fdcl activity with UbiX-prFMN"*"““4 and 
prFMN""“4 obtained through filtration of a UbiX-prFMN™"““ reaction. 
Control reactions are devoid of any DMAP substrate. b, Rate of formation of 
spectral species 2 (see Fig. 1f) in function of DMAP concentration. c, Rate of 
decay of spectral species 2 (see Fig. 1f) in function of DMAP concentration. 
d, Spectral species obtained from singular value decomposition of rapid-scan 
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stopped-flow spectrophotometric data following mixing of UbixX-prFMN™"““4 
with oxygenated buffer. e, The rate of purple radical (species B in panel 

d of this figure) formation as obtained from singular value decomposition 
of rapid-scan stopped-flow spectrophotometric data following mixing of 
UbiX-prEMN"“"*4 with oxygenated buffer has a linear dependence 

on oxygen concentration. Error bars are s.e.m., n = 3. 
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Extended Data Figure 6 | Multiple sequence alignment of UbiX/Pad1 
enzymes from selected bacterial or fungal species. Pseudomonas aeruginosa 
UbiX (NP_252708), Escherichia coli O157:H7 EcdB (NP_311620), 
Escherichia coli UbiX (YP_490553), Bacillus subtilis BsdB (WP_009966530), 
Saccharomyces cerevisiae Pad1 (AAB64980), Aspergillus niger PadA1 
(ABN13117), and orf8 from the Thauera aromatica phenylphosphate 


carboxylase gene cluster (PAAD_THAAR). Conserved residues involved in 
phosphate binding, N5 polar network or formation of the substrate 

binding m-cage are indicated by labelled arrows. Secondary structure elements 
of P. aeruginosa UbiX crystal structure are shown. Alpha helices and 3) 
helices (denoted as 1) are shown as squiggles, B-strands as arrows and f-turns 
as TT. 
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Extended Data Figure 7 | Crystal structure of P. aeruginosa Ubix-FMN- 
DMAP flash cooled to 100 K at 30 s following complete reduction by sodium 
dithionite. Two orientations are displayed as in Fig. 2. The omit map for 
the prEMN""“*4 product is shown as a green mesh, contoured at 4o. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 
K129 
E140 
b 
Rigs G91 
R122 
W84  K129 a 
w200 
K129 
E140 eens 
R139 E49 
Extended Data Figure 8 | Crystal structures of P. aeruginosa Ubix"'*. The omit map for the DMAP substrate is shown as green mesh, contoured 
a, Detailed view of the UbiX*!©°-FMN-DMAP complex with individualamino at 40. b, Detailed view of the Ubix*!©F N5-C1’ adduct species obtained 
acids contributing to active site structure shown in atom-coloured sticks through flash-cooling following reduction. The omit map for the N5-C1’ 


(carbons colour coded as in Fig. 2a). Two orientations are displayed as in Fig.2. adduct is shown as a green mesh, contoured at 40. 
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Extended Data Figure 9 | DFT models of proposed intermediate species in 
the UbiX reaction. a, DFT models of species II and IVa (as defined in Fig. 4). 
Conversion from II to IVa is achieved by ~180° rotation about C1’—C2’ (blue 
arrow) and the N5-H and methanol species (red) are only found in species 
IVa models. b, Overlay of the species II DFT model (green carbons) with the 
crystal coordinates of species II and S15 (teal carbons). c, Three DFT models of 
IVa were examined and two orthogonal projections are shown overlaid with 
the crystal coordinates (teal carbons): (Vi, yellow carbons) with a methanol 
analogue of S15 (a, in red) with the C-N5 distance fixed to the crystallographic 
distance of 4.0 A; (Vii, magenta carbons) with N5 protonated (no methanol), 
and (Viii, light pink carbons) with N5 deprotonated and no methanol. DFT 


model of species V and VI are shown in d and e, respectively, and are overlaid in 
f (V green carbons, VI magenta carbons). g, Overlay of the species VI DFT 
model (magenta carbons) with the crystal coordinates (teal carbons). Models 
were geometry optimized in the gas phase using the B3LYP/6-311+ +G(d,p) 
level of theory. Harmonic vibrational frequencies calculated using normal 
mode analysis were used to confirm that optimized geometries of all species 
were in local or global minima. In the case of species Vi, ‘ModRedundant’ 
optimisation was performed to fix the C-N5 distance and one imaginary 
frequency of 67.60 cm™ ' was observed. Cartesian coordinates of the optimized 
structures are given in the Supplementary Information. 
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Extended Data Table 1 | Data collection and refinement statistics 


UbiX WT wT WT WT E49Q E49Q E49Q NS Y169F Y169F 
FMN:DMAP N5 adduct 5 s N5/C6 N5/C6 FMN:DMAP FMNH):DM adduct >15 s FMN:DMAP N5 adduct 
following adduct, 30s adduct, co- AP,<5s following >15s 
reduction following crystallised following reduction following 
reduction radical reduction reduction 
PDB code 4ZAF 4ZAV 4ZAW 4ZAX 4ZAG 4ZAL 4ZAY 4ZAN 4ZAZ 
Data collection 
Space group F23 F23 F23 F23 F23 F23 F23 F23 F23 
Cell dimensions 
a, b,c (A) 141.9 142.18 142.74 141.79 142.31 142.07 142.02 141.73 142.26 
a, B,y (°) 90 90 90 90 90 90 90 90 90 
Resolution (A) 70.95-1.71 50.27-1.40 43.04-1.89 42.75-1.61 32.65-1.68 42.84-1.62 32.58-1.54 70.86-1.76 50.30-1.45 
(1.75-1.71) (1.44-1.40) (1.94-1.89) (1.65-1.61) (1.72-1.68) (1.66-1.62) (1.58-1.54) (1.81-1.76) (1.49-1.45) 
Roim 2.7 (29.8) 2.7 (29.7) 4.3 (33.0) 2.4 (33.5) 3.5 (35.6) 2.4 (29.2) 2.1 (26.6) 2.6 (36.0) 2.8 (35.6) 
I/ol 18.7 (2.8) 16 (2.7) 13.4 (2.7) 20.1 (2.5) 13.5 (2.2) 16.9 (2.8) 20.3 (3.0) 19.3 (2.4) 15.7 (2.4) 
Completeness (%) 100 (100) 99.6 (100) 99.9 (99.9) 99.9 (100) 99.9 (99.9) 99.8 (100) 100 (100) 99.9 (100) 99.9 (99.9) 
Redundancy 6.7 (6.8) 6.8 (6.8) 6.7 (6.7) 5.6 (5.4) 5.6 (5.5) 6.6 (6.7) 6.7 (6.7) 6.7 (6.4) 5.5 (5.4) 
Refinement 
Resolution (A) 70.95-1.71 50.27-1.40 43.04-1.89 42.75-1.61 32.65-1.68 42.84-1.62 32.58-1.54 70.86-1.76 50.30-1.45 
(1.75-171) (1.44-1.40) (1.94-1.89) (1.65-1.61) (1.72-1.68) (1.66-1.62) (1.58-1.54) (1.81-1.76) (1.49-1.45) 
No. reflections 24314 44084 18350 29027 25816 28593 33304 22355 40068 
(1295) (2399) (990) (1541) (1402) (1525) (1756) (1074) (2066) 
Ryorks Rive 13.74/17.23 9.77/12.90 14.76/19.16 14.36/17.10 15.15/17.97 13.97/16.34 14.54/17.60 14.24/18.20 10.26/14.72 
No. atoms 
Protein 1621 1627 1554 1620 1641 1664 1727 1593 1658 
Ligand 41 36 36 36 41 41 36 41 36 
Ton 13 18 13 14 4 6 15 7 20 
Water 139 194 100 127 141 139 144 116 156 
B-factors 
Protein 23.22 17.562 24.13 22.685 23.492 26.251 22.928 26.577 20.892 
Ligand 20.21 14.938 19.286 26.107 21.486 23.631 19.212 21.411 17.141 
Ton 38.34 29.653 30.154 25.756 25.435 60.387 28.885 34.894 28.034 
Water 33.289 34.115 29.071 31.840 32.838 36.643 32.833 37.493 32.705 
R.m.s deviations 
Bond lengths (A) 0.0253 0.0258 0.0209 0.0260 0.0229 0.0262 0.0262 0.0222 0.0250 
Bond angles (°) 2.1189 1.9826 1.9719 2.4312 2.1375 2.3693 2.3773 2.4068 2.2280 
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TRADE TALK Organizing career panels helps 
ascientist to land a job p.509 


JOB SEARCH How to find a career that fits 
go.nature.com/ku87pw 


NATUREJOBS For the latest career 
listings and advice www.naturejobs.com 


NURSING RESEARCH 


Nurses know best 


The time spent at a patient’s bedside makes nurses the 
perfect people to pursue potent quality-of-life research. 


BY KENDALL POWELL 


s an oncology nurse for patients with 
Aw John Merriman noticed a 
striking variation in his patients’ ability 
to concentrate after undergoing radiation 


treatment. Many people brought stacks of 
books for their hospital stays, but only some 
could make it through all of their reading. 
Several people struggled to focus even on short 
magazine articles. 

The more he thought about these cognitive 
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differences, the more he realized that, if he 
wanted to find the cause — or develop possible 
interventions — he would need to do some doc- 
toral-level research. When a research mentor 
in his nursing programme at the University of 
California, San Francisco (UCSF), mentioned 
that she had a data set from cancer patients that 
included self-reported attention scores and 
corresponding blood samples, Merriman knew 
what the topic ofhis PhD dissertation would be. 

“This would give me a way to look at this 
variability I had seen clinically using the tools 
of research,” says Merriman, who is now a post- 
doctoral cancer researcher at the University of 
Pittsburgh in Pennsylvania. “I was hooked at 
that point?” 

As a registered nurse who also holds a 
PhD, Merriman is one ofa small but growing 
number of nurse scientists worldwide. Many, 
like him, come to research because they see a 
problem in the clinic that needs a research- 
based solution. Nurse scientists arrive at a 
research-career path through a variety of 
trajectories; historically, many have begun 
their doctoral work mid- to late-career, after 
years of seeing patients. But that trend is shift- 
ing as leaders in the field encourage younger 
nurses to enter PhD programmes soon after 
earning a nursing degree and registered nurse 
certification. 

“Were the new kid on the block in academia,’ 
says Quinn Grundy, a PhD candidate in social 
and behavioural sciences at UCSE She entered 
her programme directly after completing a 
four-year nursing degree at the University 
of Alberta in Canada, and says that the PhD 
will help to provide her with flexible career 
choices because nursing research is a highly 
employable field. The US Bureau of Labor 
Statistics predicts that by 2022, more than halfa 
million jobs for registered nurses will be 
added to the US workforce. Nurse scientists, in 
particular, are sought by several sectors, 
including academia, health-care organizations 
and the biomedical industry. 

Although nurses have been engaged in 
research for generations, the field in the United 
States only began to recognize the need for more 
formal-training programmes around 1986. In 
the same year, the National Institute of Nursing 
Research (NINR) was created in Bethesda, 
Maryland, as part of the US National Institutes 
of Health. Other nations have followed suit: in 
Canada, for instance, the first PhDs in nursing 
were awarded in the early 1990s. Employment 
opportunities for nurse researchers who hold 
PhDs abound, with posts available in the > 
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> military, academia, hospital-based research 
organizations and health-care-policy think 
tanks. Many nurse scientists work at university 
nursing or medical schools, or at large clinical- 
research centres that are affiliated with them. 

Outside these sectors, some nurse scientists 
work for pharmaceutical and medical-device 
companies to help to run clinical trials. 
Others head up scientific programmes at 
government organizations, including drug 
regulators and infectious-disease agencies 
or national health administrations. Health- 
management and insurance companies, as 
well as non-profit advocacy groups such as the 
American Heart Association, also seek nurse sci- 
entists to oversee their research programmes. In 
academia, nurse scientists are in high demand as 
lecturers, thanks to globally low numbers of 
nursing faculty members who are needed to 
train the next generation. 

Nurse scientists pursue research that spans 
molecular biology, physiology, medical 
imaging, and public health and policy. Nurses 
who want to engage in research will need to 
identify a relevant PhD programme and deter- 
mine how best to balance nursing practice with 
their studies. Postdoctoral-research experience 
is not required for many of the nurse-scientist 
careers listed above, but it is necessary for nurse 
scientists who want to run their own research 
groups at research-intensive universities. 

Some nurse scientists find it difficult to 
make time to continue seeing patients in the 
clinic, whereas others manage to make it work 
synergistically. Rapid translation and adop- 
tion of nursing-research findings can quickly 
change bedside practice and bring nurse sci- 
entists immense gratification. With various 
career options from which to choose, the 
common thread that bonds nurse scientists 
is a focus on studies that improve patient care 
and, ultimately, quality of life. 


TESTS TO TREATMENTS 

After they receive a PhD, some nurse 
scientists give up seeing patients to run full- 
time research programmes. But in many 
cases, their research studies or administrative 
duties keep them in close contact with clinical 
practice. As a senior lecturer at Queensland 
University of Technology in Brisbane, 
Australia, Kimberly Alexander oversees the 
nursing school’s cancer-nursing curriculum, 
while also investigating how genetics influence 
the symptoms and outcomes of cancer treat- 
ments. Similarly, Anna Axelin, a neonatal-care 
nurse at the University of Turku in Finland, no 
longer works at the bedside, yet her research on 
providing intensive care for premature babies 
keeps her close to patients and their families, 
and on the hospital ward weekly. 

Patricia Grady, a neuroscientist and director 
of the NINR (see “The price of quality of life’), 
says that nurse scientists who wish to remain 
in the clinic must find ways to efficiently 
blend a research question with a well-matched 
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FUNDS FOR NURSES 


The price of quality of life 


One of the least known of the 27 US 
National Institutes of Health (NIH) centres, 
the National Institute of Nursing Research 
(NINR) operates on an annual budget of 
about US$144 million. That budget has 
nearly tripled in the past 20 years, but it is 
a tiny slice compared with the almost $5 
billion that the NIH gives to the National 
Cancer Institute, for example. About 10% 
of NINR funds support a small internal 
programme of seven research groups, and 
the rest of the budget is given in the form 
of grants to fund external research and 
training programmes. 

Unlike the many other institutes, 
however, the NINR is not focused on 
a particular disease area, life stage or 
organ system. “Our science is primarily 


patient population. US Navy commander Jason 
McGuire, a certified nurse anaesthetist who is 
stationed at the Walter Reed National Military 
Medical Center in Bethesda, Maryland, has 
managed to do just that. He chose to research a 
condition that affects veterans who are under- 
going general anaesthesia for surgery. 

In McGuire's work with combat veterans, he 
noticed a clinical phenomenon that was miss- 
ing from the scientific literature: his patients 
experienced ‘emergence delirium’ at a much 
higher rate than did the general population. 
With this condition, veterans wake up from 
anaesthesia in a confused and combative state. 
It can lead to physical and cognitive complica- 
tions, both during recovery and later in life. 
“T wanted to find out what the incidence was 
in our military population and how we can 
decrease the chance of it happening,” says 

McGuire, chief of 


Rapid translation nursing science and 
and adoption clinical inquiry at the 
of nursing- Walter Reed centre. 

research In 2008, the Navy 
findings considered McGuire's 
can quickly question important 
change bedside enough to send him 
practice. to get a PhD at the 


University of San 
Diego, California. Gaps in scientific knowl- 
edge of nursing practices are common, and 
often become dissertation topics for nurse 
scientists. Like many nurses, McGuire chose 
to pursue the applied side of his research ques- 
tion. “Am I really going to figure out why this 
delirium occurs? Or can I move alittle quicker, 
to something that will decrease the incidence?” 
he asked. 
Now, four years after receiving his PhD, he 
is beginning a randomized controlled trial 
to test whether giving a specific sedation 
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clinical research, but we are what can 

be described as disease-agnostic,” says 
Patricia Grady, director of the NINR in 
Bethesda, Maryland. “It’s all about the 
science of what causes symptoms, how 
can you intervene and how can you help 
people live with chronic symptoms.” 

NINR-funded research has uncovered 
differences in how men and women 
respond to pain medicine; developed 
better tools for end-of-life and palliative 
care; and led to programmes that help 
adolescents to better manage their 
diabetes. 

Grady says that both the evolution of 
health-care delivery and the ageing of 
populations worldwide ensures nursing 
researchers a bright future. K.P. 


medication during surgery — before patients 
enter recovery — can reduce the risk of emer- 
gence delirium. By the end of the 3-year, 
370-patient study, McGuire might have 
devised a new standard of surgical care for 
combat veterans. 

Such immediate application of research 
findings to clinical practice is particular to nurs- 
ing science and contrasts sharply with related 
fields, such as medicine, dentistry or veterinary 
science, in which translation of results into 
clinical interventions and treatments can take 
decades. This rapid application makes nursing 
research especially attractive to researchers who 
want to see their work improve lives right away. 

Quick adoption comes largely through 
the close ties that many nurse scientists 
keep with their colleagues who work at 
the bedside. Of the nearly three million 
registered nurses in the United States, 
Grady estimates that only 1% are nurse 
scientists. In 2013, the US Health Resources 
and Services Administration reported that 
almost two-thirds of nursing schools had 
restricted student enrolments because of 
faculty shortages (see go.nature.com/fwcli1). 
“We need more ofa critical mass,” she says. 

It is clear that increasing the number of nurse 
researchers would help to improve patient care. 
In the decade that Axelin spent in neonatal 
intensive-care units (NICUs), she noticed a 
poignant, recurring pattern: during painful 
procedures such as an intravenous insertion, 
mothers would weep outside in the hallway 
while their babies cried inside the hospital 
room. “It’s common practice to ask parents to 
step outside. But they're both crying,” she says. 
“I thought, “There must be some solution” 

Her dissertation research found that 
parents who held their premature babies 
during medical procedures alleviated the 
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children’s pain as effectively as did the 
standard long-accepted protocol — giving 
babies a sugar solution before a procedure. 
Within a year of that discovery, almost 
every NICU in Finland was using Axelin’s 
parental-hold technique, and she had 
received calls from hospitals in Sweden, 
Switzerland, Norway, Estonia and Canada that 
all wanted to implement it. 

“It’s important for nurses to know and 
notice that they can make a change in care,” 
says Axelin. “In nursing research, it’s all about 
a holistic view of the patients we work with 
every day.” 

Alexander echoes that sentiment, 
emphasizing that nursing science is not 
about discovering new therapies or drugs to 
treat a particular disease. Rather, it focuses 
on understanding people’s responses to 
treatments and alleviating symptoms to 
improve quality of life. Her research aims 
to find ways to tailor nursing care to each 
patient who has cancer. 

Grady says that the NINR’s vision for 
nursing science is to generate a knowledge 
base that will improve clinical patient care 
and help to shape US health-care policy. 
Although many nursing interventions 
might seem obvious in hindsight — such 
as parents holding their babies in the NICU 
— these care procedures must still be tested 
under rigorous conditions to prove that 
they lead to better patient outcomes than do 
current standards of care. 

In some ways, nursing science is playing 
catch-up as researchers race to fill in holes 
left in the scientific literature — for instance, 
how transport by helicopter might alter a 
trauma-patient’s physiology. The field is also 
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Anna Axelin at the University of Turku, Finland. 


poised to determine how nursing care might 
be delivered to patients now — and in the 
future — through smartphones and other 
forms of digital technology. 


IN THE THICK OF IT 
The US military has been a front-runner in 
training nurse scientists, with the US Army, 
Navy and Air Force each sending one or two 
nurses for doctoral work each year for the past 
decade. Although military nurse scientists 
run research programmes that are designed 
to address military medicine, situations and 
populations, their work spans a wide swathe of 
disciplines, from physiology to public health. 
Navy Commander Virginia Blackman, a crit- 
ical-care nurse sci- 


“In nursing entist at the Walter 
research, it’sall Reed centre, stud- 
about a holistic ies how different 
view of the pain-management 
patients we work _ protocols used on 


trauma patients 
in the field might 
affect their long-term development of chronic 
pain or post-traumatic stress disorder. Like 
many nurse scientists in hospitals, Blackman 
advises staff nurses on how best to convert the 
latest research findings into practice. 

Asa public-health nurse scientist in the 
US Air Force, Lieutenant Colonel Jennifer 
Hatzfeld has worked on projects as diverse 
as boosting mammogram rates in military 
wives and — while deployed to Kandahar, 
Afghanistan — designing protocols for treat- 
ing blast-fragment wounds. Now based at Fort 
Detrick, Maryland, she manages the Depart- 
ment of Defense’s ‘en route’ care research 
portfolio, which focuses on improving care 
for patients during air or ground transport. 

“Some days, it’s a little bit like managing 
somebody’s cheque book,’ Hatzfeld says of 
her job as. a programme manager: she decides 
which projects to fund. “But I love it because 
it’s an opportunity to strategically guide the 
future of en route care.” 

She says that nursing usually attracts those 
who want to take care of others and contrib- 
ute to making the world a better place — not 
necessarily those who see themselves pur- 
suing a doctoral degree. But nurses have 
important roles in research, she says. They 
bring unique perspectives to research teams 
because they are usually closest to the final 
stages of a study, when an intervention is 
delivered to a patient or when carers are 
helping someone to cope with side effects. 
“There are questions that we need asked 
and answered that are specific to nursing,” 
Hatzfeld says. “If we don't do it, then we are 
allowing physicians or other disciplines to 
ask those questions with what might not be 
the right perspective or approach.” m 


with every day.” 


Kendall Powell is a freelance writer based in 
Lafayette, Colorado. 
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CAREERS 


TRADE TALK 
Career doctor 


After finishing 

a PhD anda 
postdoctoral 
fellowship in 
virology, Thomas 
Magaldi forged 

a career helping 
science graduate 
students and 
postdocs to plan 
their paths. He is now the career-services 
administrator at the Memorial Sloan 
Kettering Cancer Center in New York City. 


Why did you go to graduate school? 

I wanted to be a professor at a small liberal- 
arts college. I learned that I needed a PhD 
to do that. 


How did that work out? 

During my PhD programme, I started to 
question my goals. I helped to develop 
a career-networking group for science 
trainees at Yale University in New Haven, 
Connecticut, and I explored other career 
options, including a science-policy fellow- 
ship. But every fellow I spoke to had done 
something interesting outside the labora- 
tory; I knew that I wouldn't be competitive. 


So what did you do? 

While finishing my thesis, I decided that 
I should also pursue skills that would set 
me apart. I did an internship for the US 
Department of State on how the United 
States could help Mongolia to create a via- 
ble science-education programme. I spent 
my nights doing that during the last few 
months of my PhD. Then, for my postdoc, I 
chose a mentor who would keep me excited 
about science and who was in Washington 
DC, where I would have access to policy- 
makers. I volunteered to meet with my local 
congressman about raising the US science 
budget, and I taught as an adjunct professor. 


How did all of that experience help you? 

When my wife and I had a baby, we needed 
more than my postdoc salary and we wanted 
to be near family. I realized that I had already 
built a competitive CV for many positions 
outside the lab, but still requiring a science 
PhD and postdoc training. The specific 
scientific work I had done would not distin- 
guish me, but my other experiences would. 
For the jobs that I wanted, I was ready. m 


INTERVIEW BY MONYA BAKER 
This interview has been edited for length and 
clarity; see go.nature.com/5xbdz6 for more. 
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ddiction can devastate the lives of people and their 
families. Researchers are disentangling its myriad 
causes and developing new treatments. 

Addiction tends to run in families, but scientists are finding 
that there is no simple ‘addictive personality’ Instead, factors 
that include genes, character traits and early life experiences 
combine to make the inheritance of addiction a complicated 
problem (see page S48). 

As an addiction develops it changes the brain: neural 
circuits related to pleasure and reward are hijacked and 
rewired (S46). Researchers are studying people from birth to 
try to tease out how these changes affect, and are affected by, 
brain development — and how they might be reversed (S50). 

Treatments for addiction are becoming more sophisticated, 
but still face major challenges to acceptance. Medication 
can help to wean people from their addiction (S53). One 
controversial but effective technique is to reward people 
for staying clean (S57). Just as important as repairing the 
addicted brain is fixing the social environment in which 
people susceptible to addiction live (S56). Another approach 
is to make the drugs themselves harder to misuse (S60). 

It is not only substances that can be addictive. Gambling is, 
so far, the only behaviour that has been recognized as 
an addiction, but researchers are considering adding 
Internet use, sex and shopping to that list (S62). But despite 
all the progress, the questions still to be answered are 
daunting (S63). 

We are pleased to acknowledge the financial support of 
the US National Institute on Alcohol Abuse and Alcoholism, 
National Institute on Drug Abuse, and Reckitt Benckiser 
Pharmaceuticals Inc., a subsidiary of Indivior PLC, in 
producing this Outlook. As always, Nature retains sole 
responsibility for all editorial content. 


Brian Owens 
Contributing editor 
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THE HIJACKED BRAIN 


Addiction is a devastating disease that alters the brain’s circuitry, notably in young adults. But 
the changes need not be permanent: improved understanding of them will help in developing 
ways to lessen the burden. By Margaret Munro. See a Nature Video at go.nature.com/elggkk. 


Overstimulation of the reward 
circuit leads to loss of control 
and bingeing. 


ADDICTION CYCLE i 


INTOXICATION 


Addiction is characterized by bingeing, withdrawal, cravings, 
dysfunctional emotions and an inability to abstain from 
the substance or behaviour. It quickly leads to disruption 
of interconnected brain circuits that are involved 
in reward, learning and control. 


D. Ventral tegmental area 


A primitive structure at the top of the brain 
stem in which dopamine is synthesized 
(see ‘The dopamine connection’). 


With time, other brain areas are recruited, 
including those involved in stress 
and anxiety — in effect an 
‘anti-reward’ system. 


C. Thalamus 


Acts as a hub to relay 
sensory information 
and is also important eect ese 
in regulating arousal. a . 


B. Nucleus accumbens 


This region, which receives 
dopamine from the ventral 
tegmental area, helps to 

control desire, satiety 
and inhibition. 


A. Basal ganglia 


Interconnected regions 
that are involved in 
learning, reward and 
habit formation. 


Orbitofrontal cortex 


B. Nucleus accumbens 


This area is involved in 
stages 1 and 2 of the 
addiction cycle. 


G. Hippocampus [RN 


Important for 
consolidation of 
memory. 


STAGE 2 
STAGE 3 “2 WITHDRAWAL/ 


PREOCCUPATION/ NEGATIVE MOOD 
ANTICIPATION Associated with memory 


and with emotions, notably 
anxiety and fear. 


F. Frontal cortex 


Responsible for thoughts 
and actions. The orbitofrontal 
cortex is thought to play a 
part in controlling 
behaviour. 


Chronic exposure to 
addictive substances and 
behaviours reduces the 
number of dopamine receptors 
in the nucleus accumbens (B), so 
more of the addictive substance 
or behaviour is needed to feel 
normal. Changes to the amygdala 
(E) circuitry have been tied to the 
irritability, anxiety and stress 
associated with withdrawal. 


People with dependencies 
have compulsive cravings to 
repeat the addictive behaviour. 
Drug misuse is thought? to 
alter the function of the frontal 
cortex (F) and the 
hippocampus (G) and helps to 
embed desires even if they 
have negative consequences. 
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TNO BLO ay 


people had problematic or more people smoke, with the majority of the global population drinks alcohol, with 
drug use? in 2012. living in low- to middle-income countries‘. an annual average of 17 litres per person’. 


e000 O MILLION a3 MILION 


drug-related deaths smokers die every year; more than 5 million of the deaths in 2012 were attributed 
were reported? in 2012. deaths are directly related to tobacco use’. to alcohol consumption‘. 


THE DOPAMINE CONNECTION 


Many drugs imitate natural neurotransmitters to result, directly or indirectly, in an 
increase in dopamine. Dopamine is involved in regulating feelings such as pleasure 
and in activities that include movement and learning. The most well-described 
interactions are shown here, but there are other putative routes®®. 
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dopamine release. 


COCAINE 


Blocks the removal of 
dopamine from the 
synapse, so the brain 
receives constant 
messages of euphoria. 
Synapse 
(the gap 
between 
neurons) 


DANGEROUS AGE 


Many people have their first experience of drugs at a 
young age, placing them at high risk of addiction. The 
developing brain may not form properly under the 
influence of drugs or alcohol’. 


HIGH COST OF A HABIT 


The estimated annual cost of 
health care associated with 
substance misuse in the 
United States’. 
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Genetics can influence addictive behaviour later in life, but linking genes to addiction is complicated. 


No more addictive 
personality 


The role of temperament, metabolism and development 
make the inheritance of addiction a complex affair. 


BY MAIA SZALAVITZ 


ne drunkard begets another, wrote 
() the Greek philosopher Plutarch 

nearly 2,000 years ago, demonstrating 
the age-old wisdom of the observation that 
alcoholism runs in families. 

But determining exactly what it is that 
addicted parents pass down to their children 
has proved difficult. Scientists have searched 
for decades for an ‘addictive personality’ that 
leaves someone vulnerable to drug prob- 
lems, but without success. Researchers have 
tried to identify the genes responsible for 
addiction, and they have examined the role of 
early exposure to trauma. Yet they have failed 
to isolate a single genetic factor that reliably 
distinguishes between the 10-20% of people 
who try alcohol or illegal drugs and get hooked 
and the majority who do not. 

Now, however, research into genetics and 
epigenetics is finally starting to shed some 
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light on the causes of addiction — and it turns 
out that the idea of an addictive personality 
is a myth. Instead, an enormous number of 
factors, ranging from early life trauma to genes 
that code for metabolic enzymes, have a role 
in how the genetics of addiction unfold. By 
understanding how these factors fit together, 
researchers hope to develop strategies for the 
prevention and treatment of addiction. 
Plutarch was right to say that addiction is 
often a familial trait — and it seems that much 
of this risk is carried genetically. Joni Rutter, 
director of the Division of Basic Neuroscience 
and Behavioral Research at the US National 
Institute on Drug Abuse in Bethesda, Mary- 
land, says that regardless of the drug involved, 
about 50% of the risk is genetic, within a range 
of about 40-60%. 
Alcoholism is 
the most widely 
researched addic- 
tion because alcohol 
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use has such a long history in many cultures. 
According to George Koob, director of the US 
National Institute on Alcohol Abuse and Alco- 
holism in Bethesda, Maryland, the children of 
people who are dependent on alcohol are 3-5 
times more likely to develop the disorder than 
the rest of the population — and this risk is 
roughly the same regardless of whether they are 
raised by their alcohol-dependent parents or 
adopted by parents who are not dependent on 
alcohol. The condition is about 60% heritable, 
he says, adding that this is “reasonably high”. 

Researchers may have managed to dem- 
onstrate that genetic predispositions exist, 
but linking particular genes or traits with 
addictions has proved much more difficult. 
Initial genetic findings are often announced 
with great fanfare, only to fail in replication 
or be found to have extremely small effects. 
“Addiction is very heterogeneous,’ says Rutter, 
“There are many ways to get there.” 


DRUGS AND DISORDERS 

Some temperaments and disorders do raise the 
risk of addiction, however. About half of peo- 
ple with drug-use disorders have an additional 
psychiatric diagnosis, often a mood, anxiety or 
personality disorder. “What we're finding is that 
the addictive personality, if you will, is multi- 
faceted,’ says Koob. “It doesn't really exist as an 
entity of its own.” Some people with addictions 
have many personality traits, others have none, 
but only a few have all of them. 

The personality disorder most commonly 
associated with addiction is antisocial 
personality disorder (ASPD), which involves 
dishonest, manipulative, insensitive and 
criminal behaviour. These characteristics make 
up the stereotype of someone with an addiction. 

“Antisocial behaviour and alcoholism and 
drug abuse share a bunch of genetic risk factors)” 
says Kenneth Kendler, professor of psychiatry 
and human genetics at Virginia Commonwealth 
University in Richmond, who has studied these 
links in twins. “That's replicated pretty robustly” 

A large epidemiological study found that 
18% of people with illegal-drug-use disorders 
have ASPD', as do 9% of people with alcohol- 
use disorders” — much higher than the 4% 
found in the general population. But although 
having high levels of antisocial traits is one of 
the best predictors of substance-use disorders, 
most people with addictions do not have fully 
fledged ASPD, and most people with ASPD do 
not have addictions. 

Indeed, many people with substance depend- 
ency do not have abnormal levels of antisocial 
traits at all. However, because breaking the law 
is itself a diagnostic symptom for antisocial 
behaviour, this trait will automatically be asso- 
ciated with illegal drug addiction, even if the 
only laws that are violated are drug laws. 

Moreover, being extremely sensitive and 
overly cautious — essentially the opposite of 
a callous, impulsive criminal — also raises the 
risk of addiction, although not by as much. 


NORDICPHOTOS/ALAMY 


This suggests both that the stereotype of the 
addictive personality badly mischaracterizes 
many people who have a substance-use disor- 
der, and that the genetic risk associated with 
ASPD does not account for most addictions. 

Koob points out that addiction research, like 
the rest of psychiatry, is increasingly focus- 
ing on the genetics that underlie symptoms, 
such as poor impulse control, rather than 
on syndromes such as alcoholism or ASPD. 
“There are specific types of symptoms that have 
underlying neurobiological bases,” he says. 

These temperamental or physiological 
predispositions can potentially develop into 
many different disorders. For example, impul- 
sivity could lead to a range of problems: it is 
a characteristic of addiction, ASPD, bipolar 
disorder, borderline personality disorder 
and many more. Impulsive behaviour also 
increases the risk that teenagers will try drugs 
— and make it harder for them to resist the 
urge when they want to stop. 

By contrast, anxiety can drive addiction 
in a different way: people who feel anxious 
may take drugs to cope with social fears, and 
their difficulty stopping is not through a lack 
of control, but because of a lack of alternative 
ways to manage their emotions. This means 
that programmes must be tailored to individ- 
ual needs, not based on the idea that all people 
with addictions are the same. 


PHYSIOLOGICAL FACTORS 

Personality is not the only way in which genes 
can influence addiction risk. The strongest and 
most replicated genetic risk factors for alcohol- 
ism involve genes linked to metabolism. These 
genes encode proteins that convert alcohol into 
acetaldehyde, and acetaldehyde into acetate. 
Acetaldehyde is particularly toxic, and genes 
that cause it to build up in the blood, such as 
a variant of ALDH2, make even light drink- 
ing unpleasant. “When it is floating in their 
system, people don't like it,’ says Rutter. “They 
get really hot or feel nauseous.” Hangovers 
and the anti-alcohol drug disulfiram produce 
pretty much the same effect. 

Genes that lead to slow alcohol metabo- 
lism are common in the Asian population. A 
2006 meta-analysis of 15 studies included 
4,500 Chinese, Japanese, Korean and Thai 
participants who were tested for genes related 
to the metabolism of acetaldehyde and acetate. 
The largest protective factor was the ALDH2 
variant, which makes people nine times less 
likely to develop alcoholism than those with 
other variants of the gene’. 

But even a gene that provides this much pro- 
tection can be overridden by environmental 
pressures. Between 1979 and 1992, for exam- 
ple, the percentage of Japanese people who 
misused alcohol and who had this variant rose 
from 2.5% to 13%, as aheavy-drinking culture 
developed among businessmen that made it 
much harder to refuse to drink. 

One gene that is strongly associated with 


cigarette addiction, CHRNAS, has essentially 
the opposite effect on smoking risk as ALDH2 
variant has for alcohol. Having just a single 
variant can double the risk of nicotine addic- 
tion’. This link is one of the best supported in 
any disease, not just in addiction, Rutter says. 

Researchers initially thought that the 
CHRNAS variant, which codes for a subunit 
of the acetylcholine receptor that is affected by 
nicotine, would make nicotine more pleasura- 
ble. This would explain 


why people who smoke “Antisocial 

and who havethevari- behaviour and 
antsmoke moreheav- alcoholismand 
ily than those without drug abuse 

it. But instead, it sof- shareabunch 
tens nicotine’s initial of geneticrisk 
negative effects. Nearly factors.” 


everyone who has ever 
smoked reports that the first time is nauseating 
at best. “When I tried cigarettes when I was a 
kid, I turned green and hated it,” says Rutter. 
But people with the CHRNAS variant have 
aless unpleasant experience, says Paul Kenny, 
a pharmacologist at New York’s Mount Sinai 
Hospital. “Instead of the drug being more 
rewarding, what happened was that the 
aversive effects were diminished; he says. 
Investigation of the CHRNAS gene in knock- 
out mice showed that it is active in a brain region 
called the habenula, which is involved in avoid- 
ance and aversion, even though it had not previ- 
ously been strongly linked with addiction. The 
evidence also suggests that heavy smoking may 
damage the habenula by harming the neurons 
that inhibit it. This would create strong negative 
feelings and distress in those who smoke, which 
they may try to fight with even more nicotine. 


ADDICTION AND DEVELOPMENT 

Epigenetic mechanisms, which control the 
activity of genes by switching them on and off, 
are also being seen as increasingly important 
in addictions. Kenny’s lab studies these as well 
and found that one way that addiction can 
epigenetically ‘rewire’ the brain is by turning 
on genes that are normally activated only dur- 
ing brain development. 

For instance, a mutation in the MECP2 gene 
is known to cause Rett syndrome, a develop- 
mental disorder found mainly in girls that 
is associated with intellectual disability and 
autistic symptoms. During fetal and childhood 
development, MECP2 regulates nerve-cell 
growth, and then it is silenced. However, when 
rats are allowed to binge on cocaine, Mecp2 
expression “goes through the roof”, says Kenny. 
Bingeing on cocaine rewires the brain, turning 
on genes that are usually quiescent in adults. 

Other animal experiments have shown that 
switching off the gene in the reward regions 
reduces cocaine intake’. This suggests that the 
aberrant learning, which resists the negative 
consequences of addiction, may be especially 
deeply engrained. But the actions of MECP2 
have not been studied in normal emotional 
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learning processes in humans that activate 
similar circuitry, such as falling in love, so it is 
not clear whether this is unique to addiction. 
It is also not known whether these genes are 
normally reactivated during adolescent brain 
development, which might help to explain why 
adolescence and early adulthood is the highest- 
risk period for addictions. Because Rett syn- 
drome is profoundly disabling, those affected 
are rarely exposed to drugs, so it is not known 
how the disorder affects addiction risk. 

Kenny thinks that other genes linked to 
developmental disorders may also be impor- 
tant in addictions — and not just in people 
who have these conditions, whose brains are 
wired differently from the start. If addiction 
does reactivate brain-development genes, 
more common variants could be involved. 
“We should be looking for genes that cause 
developmental disorders,” he says. 

Another factor that affects both epigenet- 
ics and addiction risk is childhood trauma. 
Severe stress in early life is known to dramati- 
cally increase the risk of addiction, and the risk 
increases with greater trauma exposure. For 
example, a recent study of the entire Swedish 
population showed that people who as children 
either lost their parents, experienced a parent's 
diagnosis of cancer, or witnessed domestic 
violence had twice the risk of a substance-use 
disorder later in life compared with those who 
did not have such stressful experiences®. 

Indeed, some risk genes, such as those linked 
to the serotonin transporter, may not cause 
any problems unless there is a stressful early 
environment. Both chronic stress and addic- 
tion can induce some of the same epigenetic 
changes in stress systems and in those involved 
with pleasure, which may partly explain why 
addiction and trauma are so tightly linked. 
“Early life experience may dictate whether or 
not those genes or variations in those genes 
in those different circumstances tend to come 
into play,’ says Rutter. 

Given the increasing evidence of how varied 
addiction is, treatment and prevention pro- 
grammes will need to be significantly updated. 
Some researchers are trying to work out how to 
target prevention to particular temperaments, 
rather than attempt to reach both the anxious 
and the impulsive with the same message. 

“That’s the blessing and the curse,” says 
Rutter. “There are many ways to get there, but 
that also means many ways to intervene.” m 


Maia Szalavitz is a science writer based in 
New York City. 
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Rewiring the brain 


Neuroscientists are learning how to repair neural circuits damaged by addiction. 


BY KATHERINE BOURZAC 


euroscientist Woody Hopf opens 
N: cabinet in his alcohol research 

laboratory at the University of San 
Francisco, California. Inside is a cage containing 
a rat that is being taught addictive behaviours. 
The rat has been conditioned to press a lever 
to release a squirt of alcohol when it hears a 
beep. Hopf closes the cabinet so that the rat will 
not be distracted by the sights and sounds of 
human visitors. Just as it takes time for people 
to undergo the characteristic brain changes that 
enforce addiction, he says, it will take time for 
his rat to become dependent on alcohol. 

Researchers such as Hopf view addiction 
as a disease of the brain circuits responsible 
for pleasure, stress and decision-making. 
“Addictive substances come at the brain in dif- 
ferent ways, says George Koob, director of the 
US National Institute on Alcohol Abuse and 
Alcoholism (NIAAA) in Bethesda, Maryland. 
“But in the end, they're activating some of the 
same circuitry and patterns of behaviour.” 

For decades, researchers have been mapping 
the electrical and chemical circuits that underlie 
addiction. Now they are working on strategies 
for healing these neural pathways. Imaging stud- 
ies show how the brain rewires during recovery 
from addiction. When combined with studies 
of how the brain develops during adolescence, 
the work could help researchers to understand 
how the brain changes that are characteris- 
tic of addiction occur, as well as who is most 
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vulnerable and why. This work is rapidly being 
translated into treatments. By using electrodes 
and fibre-optic cables, researchers can inter- 
vene in neural circuits with great precision, 
causing animals to lose their taste for alcohol 
or their interest in cocaine, not just for days but 
for weeks or months. This work is now being 
tested in people. Researchers hope that therapies 
to heal damaged brain circuits will improve the 
odds of people overcoming addictions. 


CROSSED WIRES 
Koob divides addiction into three stages, 
each with its own brain circuit — groups of 
neurons or larger structures that interact ina 
characteristic way (see page S46). Addiction 
starts with the feel-good binge stage, which is 
fuelled by the brain’s reward circuit, particu- 
larly at the nucleus accumbens. Withdrawal 
brings stress, centred in the emotional amyg- 
dala. Finally, craving and compulsion circuits 
extending from the prefrontal cortex keep 
someone using a drug, regardless of negative 
consequences. Impulsive bingeing leads to hab- 
its as the user needs the drug to feel normal. 
The changes to the brain’s circuitry are 
long-lasting, so people trying to give up will 
often relapse. Even years after recovery, people 
often start using again when some cue, such 
as the smell of alco- 
hol or the site of an 
old hangout, retrig- 
gers old patterns. 
But the changes are 
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not permanent. “The brain can enjoy some 
recovery, probably through remodelling to 
override the broken parts,” says Edith Sullivan, 
an experimental psychologist at Stanford 
University in California. 

Some of the physical damage caused by 
alcohol misuse can be undone. For example, 
says Sullivan, the brains of people who have 
misused alcohol for a long period shrink, but 
some of that brain volume can be regained 
by sustained sobriety. There is also some 
functional recovery — even if the pathways are 
not fully restored, the recovering brain starts to 
find workarounds. 

Sullivan’s group has been using functional 
magnetic resonance imaging (fMRI) to study 
cognition in those recovering from alcoholism. 
A cognitive skill the researchers focused on is 
spatial working memory — the thinking that 
helps you to remember where you parked your 
car, for example. Poor spatial working memory 
is characteristic of alcohol misuse. 

Sullivan’s research suggests that people 
recovering from alcohol addiction manage to 
work around brain damage; in other words, 
their brains find ways of accomplishing tasks 
by avoiding using damaged areas and they start 
to regain their working memory’. The group 
found that alcohol-dependent people who had 
been sober for at least a month performed as 
well as non-alcohol-dependent controls on spa- 
tial working-memory tasks, but used a different 
part of the brain to do it. Sullivan gave them a 
more abstract task than looking for a lost object 
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or a parked car, but like those tasks it required 
visual processing, which can take one of two 
broad neural paths. Patients without brain 
damage typically rely on a ‘where’ pathway to 
do the task, whereas those in recovery from 
alcohol dependence activate a ‘what’ pathway, 
which tends to be used for recognizing and 
identifying what we see. 

“The next step is to find out how to train a 
person with brain damage to use these new 
pathways,” says Sullivan. Encouraging the 
natural recovery process could help people 
who are dependent on alcohol to make faster 
progress. Sullivan compares the brain damage 
from alcohol addiction to that caused by stroke. 
“Recovery wont take three days, it may take 
three or six months, or a year,’ she says. It takes 
time for changes to occur in the brain when 
someone develops a dependence on alcohol, 
and it takes time to undo that. 

Sullivan is currently investigating whether 
there is a cost to this rewiring. She suspects 
that people in recovery are performing the 
cognitive steps needed for these tasks sequen- 
tially, so they take longer than people without 
addictions who do the steps rapidly in parallel. 
The damaged brain has fewer circuits to use, so 
the brain finds it harder to multitask. 


EARLY START 

Our understanding of the addicted brain 
comes from animal studies and from research 
on people who are already addicted or are 
in recovery, such as Sullivan's participants. 
Researchers can only guess at how these 
changes develop in people. Henning Tiemeier, 
a psychiatric epidemiologist at Erasmus 
Medical Center in Rotterdam, the Neth- 
erlands, says that the only way to see these 
changes is to follow people over time. “There 
is alot of debate about how harmful substance 
abuse is for brain development, and you cannot 
prove it with one brain image,’ he says. 

Two studies, one planned in the United States 
and one already underway in the Netherlands, 
could provide some answers. Both will follow 
adolescents. The adult brain is already formed, 
although it is still plastic, which is why alcohol- 
ism and drug addiction become so engrained, 
and why the resulting damage cannot be fully 
repaired. The worry, says Koob, is that the 
developing brain may not form properly under 
the influence of drugs and alcohol. Children 
do not have the cognitive skills to make good 
choices, making them particularly vulnerable. 
“Young people have a well-developed reward 
system but they don’t have a good executive 
control centre,” says Koob. The key part of that 
centre, in the brain’s prefrontal cortex, does not 
finish developing until about the age of 25. 

The US National Institutes of Health (NIH), 
a federal agency that includes the NIAAA 
and the National Institute on Drug Abuse 
(NIDA), is currently accepting proposals for 
the Adolescent Brain Cognitive Develop- 
ment study, which will enrol 10,000 children 


A woman receives transcranial magnetic 
stimulation, a non-invasive therapy that is being 
used in Italy to treat cocaine addiction. 


aged 10 and follow them into adulthood, using 
neuropsychological tests, brain imaging and 
surveys, focused specifically on addiction. 

Tiemeier is working on the Generation R 
study in the Netherlands, which has a broader 
focus on fetal and childhood development 
and has been following 10,000 children from 
before birth. The youngest are now aged 9, and 
the oldest are 12, a stage when some will begin 
experimenting with cigarettes and alcohol. 

Generation R is collecting the first set of brain 
MRI scans from children in the study, and has 
about 3,300 so far. By continuing to collect them 
as the children grow, changes over time will 
become clear. This is by far the largest brain- 
imaging study on adolescents in the world, says 
Tiemeier, so it should provide evidence about 
how substance use affects the developing organ. 
He does not expect to see major developmental 
changes associated with the occasional sub- 
stance use likely to be found in Generation R 
because it is a general population study, rather 
than being focused on people who are addicted 
toa substance. For this reason, such studies need 
to be as large as possible if they are to find out 
what damage drug use does, and how it interacts 
with puberty, when surges of hormones affect 
behaviour and brain development. 

More information will be available when the 
Generation R data are combined with results 
from the NIH study, says Nora Volkow, direc- 
tor of NIDA. These studies will provide a better 
understanding of the brain changes that reflect 
what she calls “the skeleton of addictive behav- 
iours”. Addiction to cigarettes is different from 
addiction to heroin, for example, but all addic- 
tions have acommon neurological framework. 
These studies will show how it grows. They 
should also yield insight into who is vulnerable 
and why, and how they might be helped sooner. 
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But as further research deepens our under- 
standing of addiction as a disease character- 
ized by changes in the brain, researchers and 
policymakers need to think about better ways 
to evaluate medications and therapies, says 
Volkow. Currently, any pharmaceutical treat- 
ment for addiction needs to show that the 
patient is now completely free of their addic- 
tion, which is difficult to prove and takes a 
long time (see page S53). “Rather than ask for 
an outcome of complete abstinence, shouldn't 
we evaluate these treatments on their ability to 
counteract these brain changes?” she asks. 


PAINFUL REALITIES 

This focus on reversing changes to the addicted 
brain is leading to therapy ideas that are show- 
ing promising early results in animals. Hopf’s 
rat studies, for example, have led to a potential 
therapy for alcoholism that is focused on coun- 
tering the compulsion to use despite negative 
consequences such as the loss of relationships 
with family and friends, employment or health. 
Because rats do not fear these outcomes, Hopf 
uses simpler analogues. In some experiments, 
alcohol-dependent rats are given extremely 
bitter alcohol instead of the expected normal 
flavour, or in the lever-pressing test they occa- 
sionally receive a painful electric shock to their 
paw. “The rats want the alcohol but they are not 
happy about it?” Hopf says. 

After years of painstaking research and some 
luck, Hopf found that a particular group of neu- 
rons in the reward-centred nucleus accumbens 
has a key role in promoting compulsive drink- 
ing. This year, he found that an approved drug 
called p-serine binds to receptors on these neu- 
rons, causing them to fire less often, leading the 

alcohol-dependent rats 


“There is a lot to drink less’. It seems 
of debate about — toworkby disabling the 
how harmful compulsive behaviour 
substance — by turning off the 
abuse is power to deny painful 
for brain realities. Rats that expe- 


rience bitter or painful 
consequences drink less 
when given the drug. Rats have no such negative 
consequences to fear and are not affected by the 
drug and drink as normal. 

The nucleus accumbens and a denial of the 
reality of the situation are involved in multiple 
stages of addiction, according to Koob, and have 
a role in both intoxication and the withdrawal 
process. Hopf is now writing up a plan for a 
clinical trial of D-serine. 

Other techniques target addiction circuits by 
using physical interventions, rather than drugs. 
Researchers at the University of Geneva in Swit- 
zerland led by neurologist Christian Liischer 
have used a method called optogenetics to target 
a particular group of cells and receptors involved 
in cocaine addiction in mice. Optogenetics 
allows researchers to turn off gene expression 
precisely by shining light into the brain through 
implanted optical fibres. When Liischer’s group 
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METHAMPHETAMINE MISUSE 


Brain scans of someone who has never tried the stimulant methamphetamine (left) and of a user (right). Use 
of the illegal drug affects dopamine transporters in the brain. Eventually, users need more drug to achieve the 
desired effect. Dopamine is a neurotransmitter that plays a key part in the brain’s pleasure pathways. 


used the method to calm a group of overactive 
dopamine-receptor neurons in the nucleus 
accumbens, the mice stopped seeking cocaine’. 

However, optogenetics cannot be used to 
treat people. The method first requires genetic 
engineering to render the target cells sensitive to 
light, and it is not yet possible to safely implant 
optical fibres in the human brain. 


STIMULATING RECOVERY 
Instead, Liischer’s team is attempting to emulate 
the effects of optogenetics by using methods 
that translate better to the clinic. They are 
developing a variation on deep-brain stimu- 
lation (DBS), a technique that uses an electric 
current to silence overactive neurons, which 
is commonly used to treat movement disor- 
ders such as Parkinson's disease. By careful 
placement of the electrodes, clinicians can 
target DBS to a particular region in the brain. 
Researchers have tried using it to treat addic- 
tion in people, but results have been mixed. 
Liischer is combining DBS with drugs to block 
particular receptors in the rat brain, making it 
possible to silence specific cell types. First they 
implant an electrode in the nucleus accumbens. 
Then they use a drug that blocks the neurons’ 
dopamine receptors. Finally, they switch on the 
electrode for ten minutes. The effects of DBS 
for treating Parkinson's are transient: when the 
electric field is turned off, the tremor returns. 
But Liischer’s combined therapy had a longer- 
lasting effect*. After 10 minutes of stimulation, 
the rats exhibited normal behaviour for the fol- 
lowing 21 days. Liischer thinks this means that 
the treatment may be repairing part of the circuit 
that was damaged by addiction. He says that the 
groups next step will be to test this approach in 
primates, or possibly take it to clinical trials. 
This demonstration of an apparently long- 
term reversal of drug-related behaviour is “a 
miracle’, says Jessica Wilden, a neurosurgeon 
at the Louisiana State University Health Center 
in Shreveport. Could this lead to a therapy in 
which you give a patient a pill and a day of brain 
stimulation and then they are drug free? “In a 
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small way that’s what they're showing,” she says. 
But doing it in people will be harder, she warns. 

Wilden is investigating whether DBS can 
be used to treat methamphetamine (meth) 
addiction. Meth affects dopamine receptors (see 
‘Methamphetamine misuse’) and is a growing 
problem, particularly in Iran and in the southern 
United States, often for military veterans. Unlike 
other drugs, which tend to be misused mostly by 
men, meth use is equally common in women, 
and has a burden on children because women 
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says Wilden. It is a huge 
challenge. The drug is a 
potent stimulant, with 
effects lasting for 16-20 hours in the rats; the 
animals become agitated and stressed, and get 
tangled up in the equipment used to administer 
the drug and the cables that connect them to the 
DBS system. 

Although DBS is a helpful research tool, 
Wilden and Liischer both doubt whether it 
can be widely used to treat addiction — and 
Wilden’s work with meth illustrates the diffi- 
culties. The therapy is expensive, invasive and 
requires patients to care for the implants and 
to return to the clinic for regular follow-ups. 
Those motivated to overcome alcoholism might 
be able to do it. But people with more destruc- 
tive addictions, particularly to meth, are less 
cooperative and have high rates of homeless- 
ness, making the treatment even less suitable. 
“The deep-brain stimulator is a pacemaker, with 
wires going under the skin into the chest where 
they connect to a battery,’ says Wilden. “That's a 
lot of metal, especially in people who are fragile. 
There's no way I can implant this in someone 
living on the streets.” 

Lischer and Wilden plan to validate their 
interventions with optogenetics and DBS in 
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animals, and then adapt the results to clinically 
realistic techniques. The most likely candidate is 
transcranial magnetic stimulation (TMS), which 
uses a magnetic field to stimulate electrical activ- 
ity in neurons deep in the brain. One advantage 
is that TMS is non-invasive: treatment simply 
involves wearing a magnetic helmet for a few 
minutes. It is currently used to treat depression 
and migraines. 

So TMS is more patient friendly, but it is also 
more mysterious — researchers do not know 
why it works. Furthermore, it has poor spatial 
precision, which frustrates neuroscientists who 
want to target specific brain locations. But this 
might not matter, says Antonello Bonci, a clini- 
cal neurologist and scientific director at NIDA. 

In 2013, Bonci published a paper describing 
how his team had used optogenetics to reac- 
tivate an area of the prefrontal cortex that was 
abnormally quiet in cocaine-addicted rats’. The 
treated rats lost interest in pressing a lever to get 
cocaine. A few months later, Luigi Gallimberti 
and Alberto Terraneo at the University of Padova 
in Italy started using TMS to target the equiva- 
lent area in the brains of people addicted to 
cocaine. They have since been successfully using 
the technique to treat such people. 

Bonci says that the results are anecdotal, but 
exciting: most people who stuck with the treat- 
ment for a few weeks have now been clean for 
several months, and testify that they do not even 
think about cocaine any more, he says. With this 
black cloud lifted, they are able to enjoy food, 
sex, reading, family time and all the other good 
things in life. Bonci is now working with the 
Italian group to design a double-blind clinical 
trial, and is collaborating with another group to 
work out how the TMS works. “It’s up to us now 
to figure out whos getting better and why, and 
how many sessions it takes,” he says. 

In addition to TMS, the Italian patients 
also received supportive medical care and 
psychological therapy. Even with brain 
stimulation or medication, people still need 
emotional support, as well as therapy “to 
identify triggering cues and memories, and 
practise making new grooves of thought’; says 
Hopf. But with tools such as DBS and TMS, 
neuroscientists’ deepening understanding of the 
circuitry of addiction is now being translated to 
the clinic much more rapidly than ever before. 

“For the first time in the history of 
neuroscience, we can think about translating 
basic science to the clinic in months, as opposed 
to the 15 years it can take for drug development;’ 
says Bonci. Thanks to the new technologies, he 
says, “we're close to a treatment”. m 


Katherine Bourzac is a science journalist 
based in San Francisco, California. 
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PHARMACOTHERAPY 


Quest for the quitting pill 


Addiction researchers are optimistic that they can create effective medication to treat 
addictions. But the key question is, will pharmaceutical companies bring them to market? 


BY CASSANDRA WILLYARD 


men purchased a farmhouse in Center 

City, Minnesota, and helped to turn the 
rambling wood structure into a sanatorium for 
professionals who had become dependent on 
alcohol. The facility, called Hazelden, spawned 
one of the largest drug and alcohol addiction 
treatment networks in the United States, with 
16 centres in 9 states. 

Since its inception, abstinence has formed 
the backbone of Hazelden’s approach to recov- 
ery. But in 2009, Marvin Seppala, the institu- 
tion’s chief medical officer, began pushing for 
the network to use medication to treat opioid 
addiction. For the past 20 years the United 
States has been in the midst of an opioid- 
addiction epidemic, and as the number of 
Hazelden residents receiving treatment for 
opioid dependence grew, Seppala noticed a 
few disturbing trends. More people seemed to 
be leaving their programme before completing 
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their course of treatment, or continuing to use 
drugs while at Hazelden. Seppala returned to 
Hazelden in 2009 after two years working in 
private practice. He had seen the effective- 
ness of drugs such as Suboxone (buprenor- 
phine and naloxone), an opiate substitute 
manufactured by Reckitt Benckiser Pharma- 
ceuticals that helps to reduce cravings, and 
Vivitrol (naltrexone), a long-lasting injectable 
medication manufactured by Alkermes that 
blocks the effects of heroin and other opiates. 
Seppala thought that these medicines might 
be able to address some of the problems that 
Hazelden’s patients were having adhering to 
their programmes. 

The move was controversial. For decades, 
Hazelden had helped people with addictions 
to recover by promoting abstinence and a 
belief in the power of the 12-step programme, 
as used by Alcoholics Anonymous. “The use 
of a maintenance medication like Suboxone 
wasn't necessarily seen as appropriate,’ Sep- 
pala says. So in 2012, he began holding forums 
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with Hazelden staff to educate them about his 
vision. “We thought they were going to throw 
tomatoes and rotten eggs,’ he says. But there 
was surprisingly little resistance. Too many 
of the clinicians had seen former Hazelden 
residents relapse and die of a drug overdose. 
In 2013, the centre began offering patients 
Suboxone and Vivitrol as well as group 
counselling for opioid dependence. Although 
the number of people involved in the new 
programme is still small, Seppala has seen 
some encouraging signs. At Hazelden, the 
typical dropout rate for people receiving 
treatment for opioid addiction is 22%, he says. 
But among those with opiate dependency 
enrolled in the new programme, the drop- 
out rate was just 5% in 2013 and 2014. Six of 
Hazelden’s patients relapsed and died of opi- 
oid overdoses in 2013, but none of them were 
in the new programme that offers both medi- 
cation and counselling. “You can’t say there's 
a direct correlation,” Seppala says. “However, 
when it’s six to nothing, you've got to say that 
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there's a dramatic shift, and that we're doing 
something correct.” 

Hazleden’s adoption of opioid-addiction 
medication is a sign of a much larger societal 
shift — a growing recognition that addiction 
is a complex chronic disease that, like other 
neurological disorders, often responds to 
prescription drugs. 

But there is nothing on the market for 
people who are addicted to cocaine or 
methamphetamines. Only a handful of drugs 
currently exist to treat nicotine, alcohol and 
opiate addiction, and those medicines do 
not always work. “All the addiction medica- 
tions that are on the market, at best they’re 
successful 30% to 35% of the time,” says Stan- 
ley Glick, an addiction researcher at Albany 
Medical College in New York. “We don't only 
need new medications, we need better medica- 
tions.” Glick and others are exploring a variety 
of promising targets, and they are optimistic 
that they can create the next generation of anti- 
addiction drugs (see ‘Drugs against drugs’). 
But some addiction researchers question 
whether pharmaceutical companies, which 
have shied away from addiction therapies in 
the past, will be willing to bring the advanced 
therapies to market. 


TARGET PRACTICE 

Addictive drugs wreak havoc on the brain’s 
reward circuitry. Some, including heroin, 
mimic natural neurotransmitters. Others, 
like cocaine, bind to receptors and prompt 
the brain to release its own. But the end result 
is the same: a brain awash in dopamine, the 
chemical responsible for pleasure. That overlap 
in the molecular pathways means that it may 
be possible to develop treatments that target 
multiple addictions. “What we're interested in 
is molecular mechanisms that may transcend 
a particular addictive drug,’ says Phil Skolnick, 
director of the division of pharmacotherapies 
and medical consequences of drug abuse at the 
US National Institute on Drug Abuse (NIDA) 
in Bethesda, Maryland. That is important, he 
says, because “many people who abuse drugs 
don’t abuse just one drug”. 

Glick thinks that he may have found one 
such compound. In the late 1980s, Glick 
received a call from Howard Lotsof, who was 
formerly addicted to heroin. Lotsof claimed 
that he had discovered a cure for opiate addic- 
tion. He told Glick about a psychoactive com- 
pound called ibogaine that occurs in several 
plant species, including the West African 
Tabernanthe iboga shrub. Lotsof had already 
approached a number of scientists with his 
cure. “For better or for worse, I was the first 
one that was fool enough to become interested 
in it? Glick says. Glick imagined that he would 
be able to give the drug to a few morphine- 
addicted rats and quickly debunk Lotsof’s 
claims. But to Glick’s surprise, ibogaine 
worked. “So we started to get more interested 
in it,’ Glick says. Ultimately, it turned out that 
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Lobelia inflata is a source of lobeline, which may 
help to curb the rush from methamphetamine use. 


the drug has some significant drawbacks. 
It can slow the heart and, at high doses, can 
damage the nervous system. “There was no 
way ibogaine was ever going to be an approv- 
able drug in the United States,” Glick says. 

So Glick partnered with a medicinal 
chemist and began searching for a new drug, 
something that would produce the same 
response as ibogaine, but without all the toxic 
side effects. The pair landed on a compound’ 
called 18-MC. “It doesn’t work like any other 
medication that’s ever been proposed to treat 
addiction,” Glick says. 

Although some addiction therapies work 
directly on the circuitry that shuttles dopa- 
mine through the brain, the pathway that 
seems to play a crucial role in most forms of 
addiction, 18-MC works indirectly. It binds 
to a nicotinic receptor called a-3 6-4, which 
is concentrated primarily in the middle of the 
brain. These receptors are not part of the dopa- 
mine pathway, but Glick’s research suggests 
that by blocking the a-3 8-4 receptors, 18-MC 
dampens the dopamine pathway’s euphoric 
response to drugs’. Glick and his colleagues 
have found that 18-MC works in all kinds of 
addiction models, curbing animals’ use of 
cocaine, methamphetamines, morphine, alco- 
hol and nicotine. “It opens the door for a whole 
new approach for affecting the reward system 
and for reducing addictive behaviour,’ he says. 

Lotsof, who spent much of his life pushing 
for an anti-addiction therapy, died of cancer in 
2010. But Glick kept working to make Lotsof’s 
dream a reality. The same year that Lotsof died, 
Glick began to work with a biotechnology 
company called Savant HWP, headquartered 
in the San Francisco Bay Area, California, to 
help develop 18-MC further. The first human 
study began in Brazil in July 2014, led by 
Savant’s South American partner, Brazil-based 
Hebron Farmaceutica, which is developing 
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18-MC for a different condition: the para- 
sitic disease leishmaniasis. The collaboration 
makes good financial sense, Glick says. Both 
companies need to demonstrate that the com- 
pound is safe before they can move forward, 
and phase I studies, which assess safety in dis- 
ease-free participants, are similar regardless of 
the intended use. 

The results have yet to be published, but 
Steven Hurst, Savant’s CEO, says that so far, the 
compound seems to be safe. The next study, 
slated to begin this year, will start to gather 
data on whether 18-MC can help people who 
smoke to break their nicotine habit. 

Savant’s researchers are not the only ones 
pursuing the a-3 B-4 receptor as a target for 
addiction medications. Nurulain Zaveri, a 
medicinal chemist, was already hunting for 
medicines to curb nicotine addiction when she 
learned about Glick’s findings in 2003. She was 
intrigued by the prospect that the largely over- 
looked receptor could be a good target for nic- 
otine dependence. But she noticed that Glick’s 
compound hit a variety of different targets, not 
just a-3 B-4. Zaveri wanted something more 
selective, so she began screening compounds. 
In 2007, she found one that seemed to be not 
only selective but also potent — a chemical 
called AT-1001. 

In 2008, Zaveri founded a company called 
Astraea, headquartered in Mountain View, 
California, to develop AT-1001 and simi- 
lar compounds as therapies to help people 
stop smoking. In 2012, her team showed 
that AT-1001 can block self-administration 
of nicotine in rats’, and in June 2015, they 
reported that the compound may also prove 
valuable for treating alcohol addiction fol- 
lowing studies on rats*. Zaveri says she also 
has data to suggest that AT-1001 might help 
to stop cocaine dependence. Her other lead- 
ing compounds seem to show similar effects, 
and Zaveri is currently trying to decide which 
compound to move into clinical trials. 

Linda Dwoskin, an addiction researcher 
at the University of Kentucky in Lexington, 
is working on a differ- 
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addiction.” monly known as Indian 
tobacco. 

Lobeline binds to nicotinic receptors that 
are involved in nicotine addiction — others 
were already investigating it as a potential 
smoking-cessation tool. But Dwoskin dis- 
covered that the compound also binds to a 
protein in the brain called VMAT2, a trans- 
porter that carries neurotransmitters such as 
dopamine and serotonin. VMAT2 is also the 
target for methamphetamines, but lobeline 
did not seem to produce the drug's pleasurable 
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effects’. Dwoskin realized that it might be pos- 
sible to use lobeline to block VMAT2, thereby 
preventing the addictive rush associated with 
methamphetamine use. 

Dwoskin launched Yaupon Therapeutics 
in 2002 and took the compound from the lab 
to clinical trials. But when people addicted to 
methamphetamines began taking the lobeline 
tablets, she immediately realized there was a 
problem. The drug tasted terrible, and many 
of the participants developed nausea — not 
that surprising, because physicians used to 
prescribe L. inflata to induce vomiting, earn- 
ing it the nickname ‘puke weed. “It was a minor 
untoward effect, but enough that compliance 
to the trial was probably going to be an issue,” 
Dwoskin says. “We decided we could probably 
do something better.” 

So Dwoskin went back to the drawing board 
and began working on compounds that would 
specifically target VMAT2. Over the past dec- 
ade, she and her colleagues have developed 
several generations of VMAT2-targeting com- 
pounds. “The ones that we're looking at now 
are extremely exciting,’ she says. They stop 
animals from self-administering the drug, and 
even seem to prevent drug-seeking behaviour. 
“Tve never seen anything like that before,” she 
says. Dwoskin will need funding to continue 
developing the drug in preparation for a human 
trial. “I feel a need to see this to completion 
because it looks so promising,” she says. 


A SHOT IN THE DARK 

Although many researchers have been focused 
on developing drug treatments, others have 
been trying to develop vaccines to curb addic- 
tion. The goal is to induce an immune response 
against addictive substances such as cocaine or 
nicotine. Then, when the vaccinated individual 
takes the drug, natural antibodies would pre- 
vent the drug's active ingredient from reaching 
its target in the brain. Without a pleasurable 
rush, people might be less prone to relapse. 
Kim Janda, a chemist at the Scripps Research 
Institute in La Jolla, California, began working 
on a vaccine in the 1980s. Over the past three 
decades, he has worked on vaccines against 
nearly every type of addictive compound: 
methamphetamines, cocaine, heroin, nico- 
tine, tetrahydracannabinol (or THC, the active 
compound in marijuana) and rohypnol. Each 
one required a different approach. 

Of these, Janda thinks that his vaccine 
against heroin holds the most promise. It 
combines a heroin-like molecule with a carrier 
protein designed to elicit an immune response. 
Heroin breaks down quickly in the body into a 
compound called 6-acetylmorphine and then 
into morphine. Janda’s vaccine is designed to 
mop up all three components, keeping them 
out of the brain and preventing the rush that 
heroin typically provides. In 2013, Janda and 
his colleagues reported’ that the vaccine seems 
to prevent both drug-seeking behaviour and 
relapse in a rat model. In the most challenging 
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A variety of promising pharmaceuticals are currently being developed to treat addiction. But it will be years 
before any of them join the small number that are already on the market. 


Therapy Status Developer Indication 

18-MC Phase | clinical trial Savant HWP Nicotine dependence 

AT-1001 Animal studies Astraea Therapeutics Nicotine dependence 

GZ-793A Animal studies Linda Dwoskin Methamphetamine 
dependence 

HeroVax Animal studies Kim Janda Heroin dependence 

TV-1380 Phase II clinical trial Teva Pharmaceuticals Cocaine dependence 


experiment, researchers forced rats that had 
become addicted to heroin to abstain for 
30 days. When they gave the rodents free 
access again, rats that had received a sham vac- 
cine quickly ramped up their use of the drug, a 
behaviour that in humans often leads to over- 
dose because the body has lost its tolerance. 
Vaccinated rats resumed taking the drug, but 
their consumption did not escalate. 

Janda has since tweaked the vaccine and 
method of injection, and this second vaccine 
seems to be more effective. But finding some- 
one to help him move to clinical testing might 
prove difficult. Clinical trials are enormously 
expensive, and so far Janda has not had much 
interest from investors or the pharmaceuti- 
cal industry. He thinks that some companies 
might also be turned off by previous vaccine 
failures. A vaccine for nicotine reached a 
phase III clinical trial in 2009, but ultimately 
flopped. There is a mentality of “well, you guys 
had your chance, and it didn’t work’, he says. 


PROFIT MOTIVE 

Any compound that makes it into clinical trials 
risks failure because of unexpected side effects 
or because it does not work as well as hoped. 
But some addiction researchers are worried 
that their experimental therapies will fail for 
a different reason: lack of interest. 

The pharmaceutical industry tends to shun 
addiction therapies because they are viewed 
as unprofitable, Janda says. “Pharmaceutical 
companies don’t view drug addicts as good 
investments.” But, according to Skolnick, that 
perception is wrong. In 2012, before Subox- 
one went off-patent, sales topped US$1.5 
billion. The drug outsold blockbusters like 
Pfizer’s impotence pill, Viagra (sildenafil). 
“Those numbers have made it a more inter- 
esting game,’ Skolnick says. And he thinks that 
today more companies are willing to take the 
risk. For example, NIDA recently partnered 
with Teva Pharmaceuticals, an Israel-based 
company with the ability to both manufac- 
ture and sell medicines, to test the efficacy of 
a compound called TV-1380 to curb cocaine 
addiction. Teva “isn’t one of these little biotech 
companies where once you do atrial, they look 
for a partner,’ he says. “They understood that 
you can do good and do well at the same time.” 

Skolnick thinks there may be an even more 
serious barrier keeping drug companies at 
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bay. For addiction therapies, the US Food and 
Drug Administration views abstinence as the 
gold standard for approval. That is, the agency 
wants to see a higher rate of abstinence in the 
treatment group than the placebo group. So 
even a medication that helps people to use less 
ofa drug might not gain regulatory approval. 

“That seems to be a very, 


“Ifeelaneed very high bar to jump 
toseethisto over, Skolnick says. “I 
completion think that that puts some 
because drug companies off.” 
it looks so Skolnick does not think 


that such a stringent out- 
come makes much sense. 
“It would seem intuitively obvious, especially 
for an illegal drug, that if you use it less fre- 
quently it would have some benefit” 

To doctors such as Seppala, who witness 
the aftermath of drug addiction on a daily 
basis, the need for new medicines seems 
obvious. “This is a remarkably complex ill- 
ness,” Seppala says. “Recovery rates are simi- 
lar to other chronic illnesses, but we dont feel 
that’s adequate. Better treatments are neces- 
sary.” But Seppala, who has struggled with 
addiction himself, cautions that even the 
best medicines will not be a panacea. “People 
with addiction have often destroyed relation- 
ships, done things they don’t even want to 
admit to anyone,’ he says. “If you just give a 
medication, youre basically saying it’s only a 
biological illness and ignoring the rest of this 
problem.” That is why Hazelden combines 
a 12-step programme with medication and 
therapy. To overcome the epidemic of opiate 
addiction, “we have to use everything at our 
disposal,” Seppala says. “We can't rely ona 
single approach.” = 


promising.” 


Cassandra Willyard is a science writer based 
in Madison, Wisconsin. 
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e have known many of the fundamental aspects of addiction 

for some time. Individuals with addictions gradually give 

up other pleasures in favour of alcohol or other drugs; 
they experience cognitive difficulties and find it hard to stop think- 
ing about their substance, and they often fail to control its intake or 
to cope without it. Remarkable progress has been made in identifying 
neurological factors and changes associated with these behaviours; 
this in turn has led to pharmaceutical treatments. However, although 
neurological aspects are crucial, they are not sufficient to explain, 
prevent or ameliorate addiction. 

The development of an addiction is a continuous process, begin- 
ning with occasional substance use followed by habitual heavy use 
and culminating with seemingly incessant or episodically excessive 
use. Throughout this process, there are neurophysiological changes 
that occur. After a substantial period of frequent, heavy and patterned 
use, the term addiction is applied. The concept 
of disease, representing the underlying neuro- 
physiological changes that occur, is commonly 
invoked to explain the addictive behaviours. 
But addressing substance use also requires 
that we recognize the social and psychologi- 
cal processes that underlie the development of 
addiction. These processes remain influential 
even as the circuitry of the brain adapts to the 
escalating use of a drug. In this sense, addiction 
is more than a disease and involves more than 
the brain: it is a systemic behavioural disorder 
arising from and maintained by psychological, 
social and biological processes operating both 
independently and in concert. 

In most cases, the development of addiction 
emerges from an acquired pattern of substance 
use that serves some motivational function: 
enhanced positive feelings or states, and reduced unpleasant feelings 
or states. Individuals who expect that a substance can improve these 
moods are more inclined to use that substance. Whether someone 
progresses to addiction depends in part on how well that substance 
meets those expectations. Over time, sustained excessive substance 
use can increase negative emotions and reduce the impact of positive 
ones through the neurophysiological changes to the brain', poten- 
tially magnifying the importance of the substance to the individual. 
But it is important to recognize that increased negative emotions and 
decreased positive emotions also arise through the impact of use on 
the individual's cognitive capabilities and the social environment. And 
that progression into addiction can be impeded by the availability of 
options other than drugs for enhancing positive states and for coping 
with negative states. 

Furthermore, when it comes to quitting, changes in brain function 
that have been caused by an addiction do not always lead to cravings, 
loss of control or relapse. Individuals are able to adjust their intake ofa 
substance or abstain altogether in response to environmental rewards. 
For example, studies have shown that providing prizes or vouchers 
for goods or services can increase abstinence from cocaine, tobacco, 
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ADDICTION IS 


MORE THAN 


A DISEASE AND 
INVOLVES MORE THAN 


THE BRAIN: 
ITS A SYSTEMIC 


BEHAVIOURAL 


DISORDER. 


Beyond the neural circuits 


To treat addiction, people need help to develop psychosocial skills in 
addition to taking medication, says Kenneth E. Leonard. 


alcohol and opiates”. Moreover, relapse is influenced by intra-personal 
processes, including self-efficacy, expectancies about the effects of 
alcohol, negative emotions and coping abilities. These psychosocial 
problems remain, even if the underlying neurophysiology of the brain 
can be rectified. 


A NETWORK OF SUPPORT 
An essential element in addressing these issues for each addicted 
individual is to identify and improve their relationships with friends 
and family, because these connections directly and indirectly affect 
both the addictive process and recovery’. Addiction makes its mark 
ona person’s social life: he or she may lose friends, alienate family and 
develop a new social network that supports their habits. The social 
environment has a significant effect on substance use. Research 
has demonstrated that marriage and a satisfying marital relation- 
ship reduce relapse among men with alcohol 
dependence and that incorporating relation- 
ship counselling into treatment for alcoholism 
leads to improved outcomes, such as abstinence’. 
Conversely, a critical partner, anger and psycho- 
logical aggression are associated with poorer 
outcomes such as relapse*. The impact of inter- 
personal relationships also extends to adult peers 
and the broader social network, both in terms of 
general support as well as specific support for 
abstinence. A network of supportive friends and 
family are key elements to recovery’. 
Relieving and perhaps reversing the 
neurophysiological aspects of addiction with 
medication is an important element of treat- 
ment. However, advances in medication for 
treatment will not, on their own, cure addiction. 
Individuals must also develop the skills and 
resources to cope with the inevitable negative and stressful experi- 
ences of everyday life and to experience meaning and pleasure in social 
and intimate relationships without the use of the addictive substance. 
Although these coping processes are represented in the brain, they 
are most effectively addressed through treatments such as cognitive 
therapy or training to improve coping skills that focus directly on the 
psychosocial aspects of addiction. The development and testing of psy- 
chosocial strategies to develop these skills and resources must remain 
an essential aim of research to help prevent and treat addiction. = 


Kenneth E. Leonard is the director of the Research Institute on 
Addictions, University at Buffalo, New York. 
kleonard@ria. buffalo.edu 
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CONTINGENCY MANAGEMENT 


Why it pays to quit 
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AS YOU PASS 


Giving a gift or acash incentive to someone to give up an addiction sounds like a prize for 
behaving badly, but the practice works. The real challenge is deciding who should pay for it. 


BY SUJATA GUPTA 


nna smokes almost two packs of 
Aces per day. Her brand of choice 

is Marlboro, full flavour, no menthol. 
Anna is 14 weeks pregnant, so that is what her 
fetus smokes, too. 

Aged 33 with short, curly blondish-red 
hair, and a hot-pink iPhone dangling from the 
pocket of a hooded sweatshirt, Anna (not her 
real name) is at the University of Vermont's 
Center on Behavior and Health in Burlington to 
learn about a smoking-cessation study for preg- 
nant women. She knows that smoking is bad for 
her baby. She also knows that she cannot afford 
the habit, which costs her more than US$7,000 
each year — approximately half of what she 
earns as an in-home childcare provider. 

At the laboratory, Anna learns that she will 
be randomly assigned to one of two treatment 


groups. Treatment A will involve three brief 
educational sessions and nine phone calls 
from a trained smoking-cessation coach. At 
each visit, including the one today, she will 
earn $50; completing all 9 calls will earn her 
an additional $65. She will also receive money 
for having ultrasound scans taken of her baby 
at 30 and 34 weeks and for participating in 
8 additional assessments in the year following 
the birth of her baby. Participants receiving 
treatment A can be compensated by up to 
$655 for their time. 

In treatment B, the ‘incentives for absti- 
nence’ condition, the protocol for partici- 
pants is the same as for those in treatment 
A, but they are also 
screened for drug 
use through fre- 
quent breath and 
urine tests at the 
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A film on a new treatment 
for addiction is at: 
go.nature.com/elgqkk 
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clinic, and are rewarded when their results 
are negative. The women earn gift cards that 
can be used to pay for things such as baby 
clothes, films, fuel for their car and groceries. 
In addition to the $655, participants in treat- 
ment B can earn $1,200-—$2,400 in incentives. 
“You guys definitely got the right idea, throw 
money at people,’ says Anna, who is hoping 
that she will be assigned to treatment B. 

Giving tangible rewards to reinforce 
positive behaviours, such as abstaining from 
drugs, underlies a field known as contingency 
management. Studies’ have shown that 
rewards that are contingent on good behaviour 
help people to refrain from addictive habits, 
including taking drugs or eating unhealthy 
food. The rationale is that financial incentives 
activate the same reward systems in the brain 
as addictive behaviours. 

Moreover, even though quitting an 
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SMOKE FREE 


In a contingency-management trial, participants who received monetary rewards in exchange for negative 
smoking tests were more likely to abstain from the habit than those who were paid but were not tested. 
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Weeks of sustained abstinence 


addiction improves a person's life in the long 
run, research has demonstrated that people are 
biased towards immediate gratification’. In 
other words, receiving an instantaneous cash 
reward is more desirable than saving the money 
that would have been spent on drugs and 
medical care over a long period. 

But despite proven results, treatment 
schemes like this are rarely offered. “I think 
in some ways that relates to the controversial 
nature of it” says Nancy Petry, an addiction 
specialist at the University of Connecticut 
in Farmington. “You're providing tangible 
rewards to people who were engaging in illegal 
or unhealthy behaviours.” To help to change 
that perception, an array of scientists are work- 
ing on ways to improve contingency manage- 
ment by tweaking how itis offered, in the hope 
that it will finally cross the boundary to an 
acceptable and commonplace treatment. 


OFFSHOOT OF THE COCAINE EPIDEMIC 

Contingency management arose out of the 
work of psychologist B. F. Skinner, who 
challenged the idea that all behaviours stem 
from free will. Instead, through experiments 
on rodents that he began in the 1930s, Skinner 
showed that behaviours can be reinforced by 
systematic punishments or rewards. 

Addiction seemed to be a suitable disease 
on which to test Skinner’s ideas. Addiction is 
“a concrete behaviour, one that’s very difficult 
to manage, but can be very easily measured’, 
says Maxine Stitzer, an addiction-treatment 
researcher at Johns Hopkins University in 
Baltimore, Maryland. 

Case studies dating to the 1960s and 1970s 
provided tantalizing evidence’ that financial 
and other incentives might help people to 
quit drugs or lose weight. But the field really 
took off in the 1980s with the rise of the 
cocaine epidemic. Many pharmacological and 
psychosocial interventions failed, but finan- 
cial incentives dramatically increased cocaine 
abstinence. 
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In the 1970s, Stitzer, one of the first research- 
ers to study contingency management with 
drug users, and her colleagues developed a 
simple protocol for people who were addicted 
to a substance: rewards for negative drug tests 
and nothing for positive tests. She experi- 
mented by providing people who are addicted 
to opiates with special 


privileges, such aslet- “Vouchers 

ting them take metha- allow the 

done (a medication individual to 
used to control crav- express what’s 
ings) athome rather rewarding for 
than ataclinic.Inone them.” 


study’, one group of 

patients could automatically take methadone 
home, but another group had to earn the 
privilege through abstinence from other drugs. 
Over a four-week period, Stitzer found that 
people who had to earn their privileges were 
four times more likely to abstain from drugs 
(32% abstinence versus 8%) than those who 
received automatic privileges. 

But researchers soon began to favour cash 
incentives. “Everybody likes money,’ Stitzer 
says. “And more importantly, you can vary 
the amount.” Also, unlike Stitzer’s metha- 
done approach, which was specific to people 
who were dependent on opiates, cash incen- 
tives can be applied to any addiction. In the 
1990s, Stephen Higgins, director of the centre 
that Anna visited in Vermont and a former 
postdoc in Stitzer’s lab, developed a financial 
incentive programme for cocaine addiction. 
Outpatients with negative test results received 
vouchers that could be redeemed for retail 
items. By giving vouchers instead of cash, says 
Higgins, patients can buy something meaning- 
ful without using the money for drugs. “What's 
a reward for you might differ a lot from what's 
a reward for me,” Higgins says. Like cash, 
“vouchers allow the individual to express 
what's rewarding for them”. 

By the 2000s, Stitzer had combined 
the methadone take-home privilege with 
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vouchers. In one experiment’, she divided 
cocaine users on a methadone treatment pro- 
gramme into three groups: one group received 
take-home methadone privileges for nega- 
tive test results; a second received the take- 
home privileges plus vouchers totalling up 
to $5,800 over 52 weeks; and a third followed 
the normal protocol of taking methadone at 
the clinic. Both sets of people with the take- 
home privileges submitted three times more 
drug-negative urine samples than the normal 
maintenance group over a one-year period. 
Furthermore, those who received vouchers 
showed longer periods of abstinence. 

A number of meta-analyses* have shown 
that incentives work considerably better than 
more conventional addiction-management 
programmes, particularly when vouchers 
are added to the protocol. What is more, says 
Higgins, vouchers are the only treatment so far 
that works for cocaine users. 

Contingency management is popular in 
helping people to stop smoking, an addic- 
tion that is seen as more amenable to low- 
intervention treatment than addictions to 
other drugs, such as heroin or cocaine. To give 
an example, in Scotland’s deprived Greater 
Glasgow region, 20% of women report smok- 
ing during pregnancy. David Tappin, who 
specialises in clinical trials for children at the 
University of Glasgow, recruited more than 600 
women, all of whom smoked. At their first pre- 
natal appointment, he randomly assigned half 
to the routine smoking-cessation programme 
run by the UK National Health Service (NHS), 
and the others to the service plus the possibil- 
ity of receiving up to £400 (US$610) in shop- 
ping vouchers. By the end of their pregnancies, 
Tappin found that just shy of 9% of women in 
the control group had stopped smoking, com- 
pared to almost 23% in the incentive group’. 


REAL-WORLD CHALLENGES 

But moving contingency management from 
the lab to the real world has proved to be chal- 
lenging. The problems are multifold. For one, 
asking participants to travel frequently to a 
drug-testing centre is often untenable. Even 
though Anna lives only 8 kilometres from the 
University of Vermont, getting to the lab took 
her one hour by bus. Then there is the issue 
of acceptability. To many taxpayers, giving 
money to people addicted to a drug seems baf- 
fling — or morally wrong. But the real hitch is: 
who should pick up the tab? 

Financial incentives make sense in closed 
health-care systems in which a single entity — 
such as the NHS — covers all of an individual’s 
health costs, says Petry. Although the initial 
pricetag may seem high, contingency manage- 
ment is cheaper than treating addiction with 
more conventional programmes down the line. 
It can also thwart the sorts of chronic health 
problems that can arise from prolonged use of 
a given substance. 

Tappin estimates that it would cost the 
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NHS £400,000 to roll out a contingency- 
management programme for all pregnant 
women who smoke in and around Glasgow. By 
comparison, he says, the NHS already spends 
£4.6 million per year on cholesterol-lowering 
statin drugs in the same region. Compared 
to other treatments or procedures already in 
place, “it’s quite a cheap intervention’ he says 
—and cheaper still if itimproves the long-term 
health of mothers and their babies. 

In countries such as the United States, 
which lack a single-payer health-care system, 
contingency management can be more diffi- 
cult to implement. But it can work for specific 
groups. For instance, the Department of Vet- 
erans Affairs, which covers all the health needs 
of military veterans, adopted a contingency- 
management programme across its outpatient 
substance-abuse clinics in 2011 (Petry is 
an independent consultant for the depart- 
ment). Individuals with negative test results 
are rewarded with gift cards to the Veteran's 
Canteen Service, which sells numerous goods 
at a discount. 


GETTING CREATIVE 

Researchers have also started to experiment 
with alternative methods to overcome the cost 
and delivery barriers associated with giving 
every participant a high-value reward when 
he or she has a negative drug test. 

One way to reduce costs is to encourage peo- 
ple to buy into an incentive programme with 
their own money, a practice known as a deposit 
or commitment contract. That money then has 
to be earned’ back through negative urine tests. 
In a study published this year®, researchers 
put the concept to the test with more than 
2,500 employees of CVS Caremark, a pharma- 
ceutical chain based in Woonsocket, Rhode 
Island, plus their relatives and friends who 
wanted to stop smoking. 

They divided those individuals into several 
groups, including one whose members could 
receive $800 if they abstained from smok- 
ing for six months and another group whose 
members had to pay $150 to participate. They 
could then earn back that $150 plus a $650 
reward if they did not smoke for 6 months. 

The researchers found that both the reward 
and deposit groups quit at higher rates than 
those receiving more conventional care, 
such as nicotine replacement therapies and 
counselling. Moreover, those who paid the 
$150 deposit were twice as likely to quit than 
those receiving a straightforward reward. But 
there was a catch. Because participation was 
optional, 90% of those assigned to the rewards 
group agreed to give abstinence a try, compared 
with just 14% in the ‘penalty’ group. “The data 
are overwhelming in showing the limitation 
of deposit contracts. The vast majority of folks 
will not join them,’ Higgins says. “The straight 
incentive programme is the way to go.” 

Other solutions have proved to be more 
successful. For instance, the Department 


Vouchers as a reward for a negative drug test. 


of Veterans Affairs uses a ‘fishbowl model 
developed by Petry in which participants with 
negative test results are eligible to draw a slip 
of paper from a bowl in much the same way as 
araffle. Rather than giving everyone the same 
lump sum, prizes on the slips vary — half of the 
slips carry no reward, others are for vouchers 
ranging in value from $1 to the cost of a new 
TV. The fishbowl model substantially reduces 
costs, Petry says, with results rivalling those 
seen with conventional voucher programmes’. 

Researchers at Johns Hopkins have 
developed the ‘therapeutic workplace; in which 
jobseekers who abuse 


substances and pre- “Vay ’p-¢ 

sent negative drug-test provi din g 

results are rewarded tangible 

with the opportunity to sawinnole 

work and earn wages. inhi 

Others are look- Peon win c 

ing at ways to ease the were engaging 

delivery of contin. illegal or 
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gency management. 
Jesse Dallery, a behav- 
iour analyst at the Uni- 
versity of Florida in Gainesville, has developed 
a model in which people who smoke are sent a 
carbon monoxide testing kit in the post. Using 
a web camera or smartphone, participants 
video themselves blowing into the machine 
and holding up the value. Negative results 
are rewarded with vouchers, or by return- 
ing a cash deposit made by the participant 
(see ‘Smoke free’). 

Dallery’s former graduate student Bethany 
Raiff, a behaviour analyst at Rowan Univer- 
sity in Glassboro, New Jersey, is now looking to 
couple that online test with video games — one 
for smartphones and one for Facebook — in 
which people who smoke earn rewards in the 
game instead of money or vouchers. She has 
partnered with video-game designers who 
specialize in making health-focused prod- 
ucts. “You have to make the games fun and 


behaviours.” 
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addictive,’ Raiff says — and then corrects her 
choice of word — “but not in a bad way”. 

Each of Raiff’s games lasts about five min- 
utes — the time it takes to smoke a cigarette. 
To stop people smoking while playing, Raiff 
and her colleagues are incorporating preven- 
tative strategies, such as two-handed playing. 
In the long-term, Raiff would like to work 
with insurance companies who have a vested 
interest in keeping clients healthy. Some, she 
notes, already cover preventative care, such as 
gym memberships. 


BACK IN VERMONT 

Although contingency management is more 
effective than conventional treatments, similar 
issues arise in both approaches — patients 
struggle to stay abstinent. Bolstered by his 
short-term success in getting people who have 
become dependent on cocaine to quit, Higgins 
began searching for a population in which 
short-term changes could produce substantial 
benefits. 

Twelve years ago, Higgins began his research 
into helping pregnant women who smoke to 
end their habit. Even short-term abstinence, 
he reasoned, could dramatically improve their 
babies’ health. So far, his research has shown 
that women in the incentive group carry larger 
fetuses, as measured by ultrasound scans at 
30 and 34 weeks, and give birth to larger babies. 
Now, he is offering incentives to women to 
continue to abstain from smoking for three 
months after they give birth, and is consider- 
ing extending that time frame. His research 
indicates* that providing a reward after child- 
birth helps women to breastfeed for longer and 
decreases the risk of postnatal depression. 

Back at the centre, Anna finds out that she is 
in treatment A and will receive compensation 
for her time but no extra incentives. “There 
are benefits to both,’ Allison Kurti, the post- 
doc running the study, tells her. “If you've got 
areal busy schedule, it’s kind of nice if we don't 
have to see you as often. Some people find that 
more convenient.” Convenient, perhaps, but 
as effective? Annas not so sure. Just as Skinner 
observed in rodents so many decades ago, 
rewards and reinforcement really are key. “I 
know if someone’ on my ass,’ Anna says, “’m 
more apt to do it” = 


Sujata Gupta is a freelance science writer in 
Burlington, Vermont. 
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TECHNOLOGY 


Barriers to misuse 


Ingenious pill formulations and the latest manufacturing 
technologies are helping to stem the tide of painkiller addiction. 


BY ELIE DOLGIN 


ary Marcuccio’s life was turned upside 
Mie by drug misuse and addiction. 

Her son, now 26, started with alcohol 
and marijuana. Then came cocaine and hallu- 
cinogens. By 14, he was stealing prescription 
painkillers from friends’ medicine cabinets, 
crushing and snorting the pills to achieve a 
quick and euphoric high. Within one year, he 
had graduated to injecting heroin. 

This progression is “so stereotypical’, says 
Marcuccio, founder of My Bottom Line, a 
Florida-based consulting business for families 
dealing with substance misuse. According to 
US survey data, 77% of heroin users say that, 
like Marcuccio’s son (who remains addicted 
to heroin), they misused prescription opioids 
— derivatives of natural or synthetic forms of 
opium or morphine — before trying heroin. 

But substance-misuse specialists think that 
this chain of addiction might be broken with 
the aid of the latest manufacturing processes to 
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make powerful opioid pain medication more 
resistant to various forms of tampering. Such 
drug preparations could also save lives. The 
death toll from misusing prescription opioids 
has skyrocketed around the world in the past 
20 years, with opioid-linked overdoses exceed- 
ing fatalities from road accidents or deaths 
from heroin and cocaine in countries includ- 
ing the United Kingdom, the United States and 
Australia. “It behooves us to make a greater 
effort at creating unabusable formularies,” 
Marcuccio says. 

Fortunately, the science and manufacturing 
of misuse-deterrence are advancing rapidly 
— and so is the political climate. In the United 
States — a country that consumes more than 
80% of the global opioid supply — politicians 
are beginning to craft bills to incentivize the 
development of misuse-resistant formulations. 
“The idea is to transition the market;’ says Dan 
Cohen, chair of the Abuse Deterrent Coalition, 
a network of advocacy organizations, technol- 
ogy manufacturers and drug companies based 
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in Washington DC. “There are now so many 
different abuse-deterrent formulations that are 
either in products or in development that there's 
enough variety out there for any product to be 
able to put abuse-deterrence in it” 


THE NEW GUARD 

Some of the latest tablet formulations are so 
hard that even a hammer-blow cannot pulver- 
ize them. Many pills form a gelatinous goo when 
dissolved that renders them difficult to inject. 
Others contain reversal agents that negate the 
high when the tablets are messed with. The idea 
is to create pain-relief medicines that are less 
prone to misuse yet work when taken as directed. 

The technologies in place today are not iron- 
clad, though. A quick perusal of online mes- 
sage boards and videos reveals numerous tips 
on how to circumvent the defences of even the 
most reinforced tablets. What is more, not all 
prescription opioids on the market are misuse- 
resistant. “We're still in abuse-deterrent formu- 
lations 1.0,’ says Richard Dart, director of the 
Rocky Mountain Poison and Drug Center in 
Denver, Colorado. But, he adds with a touch of 
hyperbole, “there are a zillion abuse-deterrent 
formulations coming” 

Manufacturers have been worried about 
prescription-drug misuse for decades. When 
the first controlled-release formulation of the 
opioid oxycodone hit the US market 20 years 
ago, the drug’s manufacturer, Purdue Pharma 
of Stamford, Connecticut, touted the twice- 
a-day medicine as a less-addictive alternative 
to the faster-acting painkillers that provide a 
big opioid hit all at once. In reality, however, 
Purdue's longer-lasting pill, sold under the trade 
name OxyContin, had the opposite effect. 

Drug users easily defeated OxyContin’s 
time-release mechanism by crushing or chew- 
ing it. Just one OxyContin could contain more 
oxycodone than a dozen instant-release pills 
but no extra ingredients such as paracetamol 
that make people sick if taken at high doses. 
OxyContin quickly became the number one 
addiction problem in many parts of the world, 
particularly in the United States and Australia. 
The drug was so popular among the rural poor 
of Appalachia in West Virginia and Kentucky 
that it earned the street name ‘hillbilly heroin: 

Purdue set to work to guard against some of 
the worst forms of misuse. In 2010, the com- 
pany introduced a misuse-averting version 
of OxyContin that contains a polymer made 
of long-chain molecules. This makes the new 
tablet more difficult to crush — although it is 
not rock hard. “It behaves more like plastic.” 
explains Richard Mannion, executive director 
of pharmaceutics and analytical development 
at Purdue. “So, it will deform if subjected to 
force, but it doesn't break into a powder easily.” 
The revised formulation is thus much harder to 
snort. Plus, Mannion says, when combined with 
water, the polymer forms a gummy substance 
that makes it very difficult to draw into a syringe 
(although misuse is still possible). 


NIK SPENCER 


PURDUE PHARMA LP 


The new version of OxyContin has proved 
to reduce the incidence of therapeutic misuse. 
A study' of more than 140,000 people treated at 
rehabilitation centres across the United States 
found that misuse by injection, snorting or 
smoking declined by two-thirds in the two years 
after the reformulation. In light of these results, 
in 2013, Purdue won the right from the US Food 
and Drug Administration (FDA) to describe 
the misuse-deterrent benefits of OxyContin on 
the drug's label and to make marketing claims 
accordingly. The FDA said at the time that any 
future generic versions of OxyContin would 
have to incorporate equivalent misuse-deterrent 
protection. (In April 2015, the FDA released a 
guidance document outlining the types of study 
needed to establish misuse-deterrence, but the 
report stopped short of addressing generic 
opioid products.) 

Other painkillers that now have FDA- 
approved misuse-deterrent labelling include 
Embeda, an extended-release morphine from 
New York-based pharmaceutical firm Pfizer, 
and Targiniq, another long-acting prepara- 
tion of oxycodone from Purdue. Both contain 
antagonist agents — offsetting ingredients that 
remain largely inactive when the drugs are 
taken as directed, but that will annul the opi- 
oid’s effects if the drugs are snorted or injected. 

“These new technologies are showing some 
positive results,” notes Robert Jamison, a pain 
psychologist at the Brigham and Women’s 
Hospital Pain Management Center in Chestnut 
Hill, Massachusetts. In Australia, for example, 
OxyContin users accounted for more than 60% 
of the visits to the Medically Supervised Inject- 
ing Centre in Sydney. After the tamper-resistant 
version of OxyContin hit the Australian market 
in April 2014, a team led by Louisa Degenhardt, 
a drug-addiction researcher at the University 
of New South Wales in Sydney, found’ that the 
number dropped to 5%. In the United States, 
levels of opioid misuse have decreased from 
their peak in 2010, when the new formulation 
of OxyContin arrived on the market. Rates of 
opioid dispensing and overdoses have dropped 
appreciably, too. 

These public-health benefits come with 
an economic bonus. According to calcula- 
tions from Noam Kirson and his colleagues at 
Analysis Group, a consulting firm in Boston, 
Massachusetts, the reformulated OxyContin 
has reduced misuse-related medical expenses 
and indirect societal costs by more than 
US$1 billion per year in the United States’. 
“These are substantial savings,’ Kirson says. 


OLD HABITS DIE HARD 

Despite the gains, the misuse-deterrence field 
still has a long way to go. Drug users who have 
been thwarted by one technology can switch to 
another prescription medicine that lacks anti- 
tampering defences. That is what happened in 
rural Appalachia following the introduction 
of reformulated OxyContin. Opioid misusers 
simply started snorting and injecting the less 


potent immediate-release preparations of oxy- 
codone, most of which lack misuse-deterrence 
characteristics. “It's kind of a whack-a-mole 
situation,’ says Jennifer Havens, an epidemi- 
ologist at the University of Kentucky Center 
for Drug and Alcohol Research in Lexington. 
Plus, even with the latest physical defences it 
is still possible to get high by swallowing lots of 
OxyContin or Embeda 


pills at once. Preventing “It behooves 
oral misuse requiresa US [0 make a 
different approach — greater eff ort 
whichacompanycalled at creating 
Signature Therapeutics, unabusable 


based in Palo Alto, 
California, is pursuing. 

Signature Therapeutics’ technology uses 
prodrugs, which are inactive until they undergo 
the appropriate chemical conversion in the 
body. When these pills are taken by mouth as 
directed, a digestive enzyme in the gut called 
trypsin releases part of the prodrug, initiating 
the process of opioid drug release. But because 
trypsin is not found elsewhere in the body, the 
prodrug remains inert when injected, snorted 
or smoked. Signature Therapeutics has already 
tested its painkilling hydromorphone prodrug 
in a phase I trial of healthy volunteers; the 
company plans to begin evaluating its oxyco- 
done prodrug in human studies later this year. 

Prodrugs alone do not prevent excessive 
pill-popping, but scientists at Signature 
Therapeutics have another trick up their 
sleeves. If the prodrugs look promising in 
the clinic, the company will add a second 
compound that blocks trypsin activity. This 
might seem counterintuitive, but it is all about 
threshold levels. The amount of trypsin inhibi- 
tor found in one or two pills will not interfere 
with the prodrug modification, but a handful 
of pills collectively contain enough inhibitor to 
shut down the conversion process. With this 
approach, Signature Therapeutics can create 
either extended-release or immediate-release 
opioids. Bill Schmidt, chief medical officer at the 
company, says that the potential of these drugs 
is “maximum therapeutic benefit with very low 
abuse liability”. 


formularies.” 
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The original OxyContin pill could be crushed (left) and snorted; and (right) the new tamper-resistant form. 
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New formulations such as these could 
ultimately prove to be almost addiction-proof, 
but they are not cheap. And their benefits might 
not be fully realized unless authorities require 
drug companies to include them. “The problem 
with abuse-deterrence right now is the lack of 
incentives,’ Cohen says. 

Lawmakers in the US House of Representa- 
tives previously proposed legislation that would 
have barred the approval of any new pharma- 
ceuticals that did not use formulas resistant to 
tampering. That bill died in committee, but, 
according to Cohen, revised legislation should 
be introduced again “soon” Individual US states 
have also begun to pass laws that compel phar- 
macists exclusively to dispense, and insurers 
to cover, misuse-deterrent versions of opioids 
unless instructed otherwise by a physician. 

Ultimately, the success of long-term efforts 
to rein in opioid addiction could depend on the 
regulations surrounding generic painkillers. 
In December 2014, Australia allowed the sale 
of a generic long-acting oxycodone without 
misuse-deterrence characteristics. Degenhardt, 
who is monitoring the drug-misuse data, 
worries that many of the gains of OxyContin’s 
reformulation will now be lost. By contrast, US 
authorities have already said that they will not 
approve such a product. 

All ofthese efforts should help to bring down 
the number of overdose deaths and also prevent 
experimentation with prescription pills. In her 
study population in rural Appalachia, Havens 
has met so many young people like Marcuccios 
son — for whom easily misused opioids were 
the gateway to addiction — that she has reached 
a simple, but absolute, conclusion: “The only 
way that abuse-deterrent formulations are 
going to work is if they're all abuse-deterring,” 
she says. “It can't just be piecemeal. It’s got to be 
all or nothing” 


Elie Dolgin is a science writer in Somerville, 
Massachusetts. 
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PERSPECTIVE 


hat behaviours can be considered addictions? Gambling, 
W asa Internet use, sex, shopping and eating can become 

excessive, but whether they should be labelled as addictions 
is an ongoing debate. 

In the most recent, fifth edition of the Diagnostic and Statistical 
Manual of Mental Disorders (DSM-5) — a book published by the 
American Psychiatric Association in 2013 that defines and classifies 
mental health conditions — gambling disorder was moved from 
its category of ‘Impulse-control disorders not elsewhere classified’ 
to ‘Substance-related and addictive disorders. This represents a 
significant shift from a view that has prevailed since the 1980s that 
addictions are disorders involving compulsive drug use, and multiple 
non-substance-related behaviours may now be 
considered addictions’. 

Gambling disorder is currently the only non- 
substance condition that is listed as an addiction 
in the DSM-5, although a work group proposed 
that Internet gaming disorder (IGD) warrants 
additional research. Multiple aspects of IGD 
remain controversial, including to what extent 
the Internet may be the vehicle versus the focus 
of a disorder, and, if a broader ‘Internet-use dis- 
order’ is to be accepted, the extent to which use 
represents an addiction. The work group focused 
on gaming because it was the most well studied 
and arguably problematic form of Internet use 
at the time’, but behaviours such as social net- 
working and pornography viewing are also under 
scrutiny. Such uses of the Internet also appear 
clinically relevant: problematic online social 
networking, for example, has been linked to poor emotional regula- 
tion and problems with alcohol use among university students’. Given 
that more people are growing up with digital technology, considering 
a broader range of Internet-related activities as potentially addictive 
seems important for addiction researchers. 


DEFINING BIZARRE BEHAVIOUR 

But even if such diagnoses were to be accepted, the question of where to 
draw the line between abnormal and normal behaviours is still up for 
debate and has contributed to wide variations in prevalence estimates 
for problematic Internet use’. Currently, the DSM-5 uses a more 
stringent threshold for diagnosing gambling disorder (it must meet 
4 or more inclusionary criteria out of 9) or Internet gaming disorder 
(5 or more out of 10) than it does for diagnosing substance-use disor- 
ders (2 or more inclusionary criteria out of 11); we must take care not 
to underestimate how widespread such non-substance behaviours are 
and the negative impact that they can have on public health. 

Another controversial topic is sex addiction. Formal criteria for 
hypersexual disorder have been proposed and tested’, but the condi- 
tion was not included in the DSM-5. As with the other behavioural 
addictions, debate exists about where to set the threshold between nor- 
mal and abnormal levels of sexual activity. Nevertheless, similarities in 
cognitive and biological changes involving craving and reward circuitry 
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PROBLEMATIC ONLINE 


SOCIAL 
NETWORKING 


HAS BEEN LINKED T0 
POOR EMOTIONAL 
REGULATION AND 
PROBLEMS WITH 


ALCOHOL USE. 


Behavioural addictions matter 


More research, and dedicated funding, is needed to understand and 
successfully treat compulsive habits, says Marc Potenza. 


have been noted between compulsive sexual behaviours and substance 
and gambling addictions, and scales assessing addiction-like features 
such as craving seem relevant to aspects of sexual behaviours. A better 
understanding of aetiological and associated factors, such as to what 
extent psychological and biological determinants linked to gambling 
and substance addictions also relate to hypersexuality, should help clas- 
sification efforts and promote the development of targeted treatments. 

Other behaviours, including excessive eating and shopping, are also 
sometimes considered addictions. Of note, patients receiving dopa- 
mine-boosting treatment for Parkinson’s disease have sometimes devel- 
oped excessive eating, shopping, sex and gambling habits, suggesting 
that there may be a biological link that drives all of these behaviours. 
But there are nuances: although obesity has been 
found to share biological features with substance 
addictions, the variety of ways that the condition 
manifests itself suggests that only a subset of indi- 
viduals with obesity may be characterized by food 
addiction. In particular, individuals with binge- 
eating disorder are likely to meet food-addiction 
criteria, suggesting similarities with gambling 
disorder and substance-use disorders. If foods 
are demonstrated to have addictive potential, it 
would be important to identify the specific foods 
or food components and implement relevant 
public-health policies and interventions. 

While experts debate which non-substance 
disorders may constitute addictions, people 
continue to exhibit problematic behaviours. 
Thus, more research is needed to better under- 
stand the epidemiological, clinical, neurobiologi- 
cal, genetic and cultural features to help prevent and treat behavioural 
addictions. Research was paramount in compiling the DSM-5, and 
a similar process should be used in writing the 11th edition (due in 
2017) of the World Health Organization's International Classification 
of Diseases. But for this to happen, funding agencies must prioritize 
research into non-substance addictions. In the United States, the 
National Institutes of Health includes departments focusing on drugs 
and alcohol, but none that target behavioural addictions. The creation 
ofa national institute on behavioural addictions would help to advance 
research in this area. In France, the government requires addiction 
treatment centres to provide care for people with behavioural 
addictions. Thus, how we classify these behaviours has direct clinical 
implications, and there is an important need to understand how best 
to prevent behavioural addictions and help people who experience 
harm related to them. = 


Marc Potenza is director of the Center of Excellence in 
Gambling Research at Yale University in New Haven, Connecticut. 
e-mail: marc.potenza@yale.edu 
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wd How broadly should 
we define addiction? 


We often think about addiction 
in connection with the misuse 
of substances such as alcohol, 


/ heroin and cocaine. But should 


compulsive behaviours that 
involve sex, video games and 
gambling also be treated as 
addictions? 
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Research into 
addiction explores 
many aspects of 
how and why this 
disease develops. 
Here are four of 


the toughest 
questions. 


BY DAVID HOLMES 


Gambling is the only officially 


recognized behavioural addiction. 


But there is increasing evidence 


L that other behaviours resemble 


substance misuse in their 
underlying neurobiology, and 
the way in which they respond to 
treatment (see page S62). 


Researchers want to move 
from a system that classifies 
addiction according to clinical 


. symptoms to one more rooted 


in a mechanistic understanding 
of the disease, with a greater 
role for genetic imaging and 
cognitive science. 


How is addiction 
affected by our 
genes? 


If addiction rewires 
the brain, can we 
short the circuit? 


Can we get from the 
12-step recovery 
programme to one 
shot? 


A better understanding 
of the genetic factors 
that influence the risk 


". of addiction could 


revolutionize the 

way that addiction is 
diagnosed, treated and 
prevented. 


Scientists know that the use of 
addictive substances causes 
physical changes in the brain 


L that can lead to addiction. 
“ What they are just beginning 


to understand is how those 
changes can be reversed to 
treat the disease. 


One of the biggest hurdles in 
treating addiction is preventing 
relapse: people can fall off the 


4, wagon after years of abstinence. 
~ Avaccine that neutralizes a 


substance before it reaches the 
brain could prevent people from 
returning to old habits. 


Numerous adoption and twin 
studies have shown that the 
risk of addiction is about 50% 


heritable. But identifying the 


genetic factors and the extent 
to which they interact with each 
other and the environment has 
been more tricky (S48). 


Using optogenetic-guided brain 
surgery in mice, researchers 
have been able to identify a 
type of dopamine receptor that 


“seems to have a crucial role in 


addiction. Blocking this receptor 
has reversed the symptoms of 
cocaine addiction in mice (S50). 


Researchers have been 
developing anti-drug vaccines 
for more than two decades and 


have a candidate that tricks rats’ 


immune systems into thinking 
injected heroin is a pathogen, so 
the drug is quickly neutralized 
before it reaches the brain (S53). 


Researchers are looking in 
ever-finer detail at potential 
sources of genetic variations 


’ (such as gene copy-number 


or the existence of rare 
genes) and epigenetic 
changes in the way that 
genes are expressed. 


For practical and ethical 
reasons, optogenetic methods 
cannot be used in humans, 
but their increasing use in the 


* lab could speed the discovery 


of drugs to target and reset 
the reward circuits that are 
overloaded in addiction. 


Without a trial of an effective 
vaccine in humans, concerns 
over whether a vaccine could 


, lead to drug users taking 


ever-higher doses cannot be 
addressed, but few, if any, 
pharmaceutical companies look 
likely to stump up the cash. 


David Holmes is a science journalist based in London, UK. 
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